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Preface 


The  International  Conference  on  Control  and  Estimation  of  Distributed  Parameter 
Systems  took  place  from  July  14-20, 1996,  at  the  Bildungshaus  Chorherrenstift  Vorau 
in  Vorau  (Austria).  It  was  the  seventh  in  a  series  of  conferences  that  begun  in  1982. 
51  researchers  from  11  states  contributed  to  draw  a  broad  and  diverse  picture  of 
the  recent  developments  in  optimal  control  and  parameter  identification  of  partial 
differential  equations,  both  from  a  theoretical  and  numerical  viewpoint.  We  thank 
them  all  for  their  contributions  to  an  enjoyable  and  interesting  conference. 

We  address  our  thanks  to  the  whole  staff  of  the  Bildungshaus  Chorherrenstift 
Vorau.  The  pleasant  atmosphere  at  the  Bildungshaus  has  been  a  key  ingredient  to 
the  success  of  the  meeting  and  the  stimulating  interaction  between  the  participants. 
We  are  particularly  indebted  to  Mrs.  L.  Reifi,  who  helped  us  omnipresently  with  all 
the  everyday  issues  of  a  conference  like  this  one. 

This  meeting  was  facilitated  by  funding  from  the  following  organizations: 

Amt  der  Steiermarkischen  Landesregierung, 

Bundesministerium  fur  Wissenschaft  und  Verkehr, 

Christian  Doppler  Laboratorium  fur  Parameter  Identifikation 
und  Inverse  Probleme, 

European  Research  Office  of  the  U.S.  Army, 

Spezialforschungsbereich  F003  “Optimierung  und  Kontrolle” , 

Stadt  Graz, 

U.S.  Air  Force  European  Office  of  Aerospace  Research  and  Development. 

It  is  our  pleasure  to  acknowledge  the  generous  support  by  these  institutions. 

Once  again,  the  friendly  and  supportive  team  of  Birkhauser,  in  particular 
Dr.  T.  Hintermann  and  Mrs.  S.  Lotrovsky,  have  provided  an  optimal  opportunity 
to  publish  our  proceedings.  Our  special  thanks  go  to  Mrs.  G.  Krois.  Her  enthusiasm, 
skill  and  workpower  have  been  the  backbone  of  the  organization  of  the  conference  and 
the  preparation  of  the  T^X  manuscript  of  the  proceedings  you  are  presently  reading. 


Graz,  July  1997 
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Approximation  Results  for  Parameter  Estimation 
in  Nonlinear  Elastomers 
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Abstract.  In  this  paper  we  present  an  approximation  framework  and  theoretical  conver¬ 
gence  results  for  a  class  of  parameter  estimation  problems  for  general  abstract  nonlinear 
hyperbolic  systems.  These  systems  include  as  a  special  case  those  modeling  a  large  class  of 
nonlinear  elastomers. 
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1.  Introduction 


We  consider  the  following  class  of  abstract  nonlinear  damped  parameter  dependent 
hyperbolic  systems  evolving  in  a  complex  separable  Hilbert  space  H : 

(1.1) 

wtt  +  Ai(q)w 

+  A2(q)wt  +  Af*g{q){J\fw)  =  f(t;  q) 

(1.2) 

w(0)  =  ip0 

(1.3) 

wt(  0)  =  tfi. 

Here  Ai{q),A2{q)  are  unbounded  operators  depending  on  some  parameter  q.  g(q )  is 
a  parameter  dependent  nonlinear  operator  in  H,  fif  is  an  unbounded  operator,  and 
/  is  a  parameter  dependent  forcing  term.  Precise  conditions  on  these  operators  are 
given  below. 

This  class  of  systems  was  introduced  in  [BGS,  BLMY]  and  further  studied  in 
[BLGMY]  as  a  model  for  the  behavior  of  nonlinear  elastomers.  These  materials, 
which  are  used  in  the  development  of  active  and  passive  vibration  devices,  are  rub¬ 
ber  or  polymer  based  composites  that  involve  complex  viscoelastic  materials.  Their 
behavior  cannot  be  adequately  modelled  using  the  theory  of  linear  elasticity.  Indeed, 
they  exhibit  nonlinearities  in  material  and  geometric  properties  so  that  there  is  a 
nonlinear  relationship  between  stress  and  strain  even  for  small  strains.  We  illustrate 
with  a  simple  example  that  takes  into  account  these  nonlinearities,  and  describe  the 
associated  general  parameter  estimation  problems.  (For  detailed  discussions  of  this 
and  other  models  see  [BLMY,  BGS,  BLGMY,  BL].) 
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Consider  an  isotropic,  incompressible  rubber-like  rod  under  simple  elongation  with 
a  finite  applied  stress  in  the  principal  axis  direction  aq  =  x.  Let  w  denote  deformation 
in  the  x  direction.  Following  the  derivation  of  the  model  in  [BLMY,  BGS,  BL]  we 
arrive  at  the  partial  differential  equation 


(1.4) 


d2w  d  (EAdw\  d  (EA_(dw\\ 
^  dt 2  dx  \  3  dx )  dx  \  3  ^  \  dx ) ) 


where  p  is  mass  density,  E  is  the  generalized  modulus  of  elasticity,  A  is  the  cross 
sectional  area,  and  F  is  an  applied  external  force.  If  one  assumes  that  the  rod  is 
composed  of  a  neo-Hookean  material  (see  [BL]),  then  the  nonlinearity  g  in  (1.4)  is 
given  by  g(£)  =  1  —  for  —  1  <  £  <  1.  Assuming  that  we  have  a  slender 

rod  of  length  £  that  satisfies  w(t,0)  =  w(t,£)  =  0,  and  defining  V  =  Hq(0,£)  and 
H  =  L2(0,£),  we  obtain  the  usual  Gelfand  triple  V  *— >  H  «  H*  c-»  V*  where 
V*  =  Then  equation  (1.4)  with  the  specified  boundary  conditions  can  be 

written  in  the  variational  form: 


(1.5)  pAwtt  +  Aiw  +  D*g(Dw)  =  F  in  V*, 
where  A\  €  C(V ,  V*)  is  given  by 

(Ai<p,ip)v,v  =  (^Dcp,Dip^ 

and  D  —  £  C{V ,  H)  is  the  spatial  differentiation  operator.  For  a  realistic  model 

we  also  must  include  some  type  of  material  damping  which  is  known  to  be  present 
in  elastomers  (indeed  in  all  materials).  Here  we  assume  an  internal  damping  of  the 
form  A2wt,  where  A2  €  jC(V2,  F>*)  and  V  <-*■  V2  H.  In  the  case  of  Kelvin- Voigt 
damping  we  define  V2  —  V  =  iLo(Oi^)  an(^ 

{A2lp,tp)vf,V2  =  (cdD(p,DiP)h  , 

where  Cd  6  Loo(0,  £)■  (We  remark  that  the  exact  form  of  the  damping  mechanism  in 
elastomers  is  not  known  and,  indeed,  is  the  subject  of  current  research.)  With  the 
damping  included,  we  find  that  our  model  in  variational  form  for  the  neo-Hookean 
elastomer  rod  is  given  by 

(1.6)  pAwtt  +  Aiw  +  A2wt  +  D*g(Dw)  =  F  in  V*. 

If  this  model  is  to  be  used  for  simulation  or  control  of  the  behavior  of  the  elastomer 
rod  we  need  values  for  p,  E,  A,  cD,  F,L  Some  of  these  can  be  given  or  measured  ex¬ 
plicitly  (e.g.,  A,£,  F),  or  can  be  found  from  manufacturers  specifications  (so-called 
“book-values”).  However,  some  parameters  (e.g.,  E,  cp)  cannot  be  measured  or  ob¬ 
tained  this  way.  Also,  the  “book-values”  can  vary  considerably  between  samples. 
Thus  we  need  a  method  to  estimate  these  “unknown”  parameters  by  dynamic  exper¬ 
iments  with  the  sample  itself.  Moreover,  the  nonlinearity  g  is  in  general  unknown 
and  must  be  estimated  (the  neo-Hookean  assumption  is  only  a  first  approximation  to 
actual  material  properties)  or  chosen  from  a  general  class  of  admissible  nonlinearities. 

In  one  general  parameter  estimation  formulation  equation  (1.6)  takes  the  form 
(1.1)— (1.3)  where  the  structural  operators  Ai,A2,  the  nonlinearity  g  and  the  input 
/  have  all  been  parameterized  by  a  vector  (possibly  infinite  dimensional)  parameter 
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q  that  must  be  estimated.  Here  the  parameter  q  takes  values  from  an  admissible 
parameter  set  Q.  Suppose  that  we  have  a  set  of  measured  observations  z  =  {zi}f=1 
corresponding  to  measurements  (e.g.,  displacements,  velocities)  taken  at  time  tj.  In 
a  general  least  squares  parameter  estimation  problem,  we  seek  to  minimize  the  least 
squares  output  functional 

J(q,z)  =  \C2{Ci{w{ti,-\q)}  -  {zi}^ , 

over  q  £  Q,  where  {w(U,  •;  q)}  are  the  parameter  dependent  solutions  of  (1.1)— (1.3) 
evaluated  at  time  =  1,2,  ...K,  and  |  •  |  is  an  appropriately  chosen  Euclidean 
norm.  The  operators  C\,C2  depend  on  the  type  of  the  collected  data.  For  example, 
if  Zi  is  time  domain  displacement,  velocity  or  acceleration  at  a  point  x,  then  C\ 
involves  differentiation  (0,1  or  2  times,  respectively)  with  respect  to  time  followed 
by  pointwise  evaluation  in  t  and  x.  The  operator  C 2  is  the  identity  in  the  case  of 
time  domain  identification,  while  it  is  related  to  the  Fourier  transform  if  we  consider 
fitting  the  data  in  the  frequency  domain  (see  Chapter  5  of  [BSW]  for  details). 

In  this  formulation  the  minimization  problem  involves  an  infinite  dimensional  state 
space  and  (in  general)  an  infinite  dimensional  admissible  parameter  set  Q.  To  over¬ 
come  this  difficulty  and  to  obtain  a  computationally  tractable  method,  we  use  the 
general  ideas  described  in  [BSW].  Namely,  let  HN  be  finite  dimensional  subspaces  of 
H,  and  QM  be  a  sequence  of  finite  dimensional  sets  approximating  the  parameter  set 
Q.  Denote  the  orthogonal  projections  of  H  onto  HN  by  PN.  One  can  formulate  a 
family  of  approximating  estimation  problems  with  finite  dimensional  state  spaces  and 
finite  dimensional  parameter  sets  in  the  following  way:  find  q  £  QM  which  minimizes 

(1.7)  JN{q,z)  =  \C2{Ci{wN{tu-]q)}  -  {zi}}^  , 

where  wN{t\  q)  £  HN  is  the  solution  to  the  finite  dimensional  approximation  of  (1.1)- 
(1.3)  given  by: 

(!-8)  (w%,4>)v*y  +  (Ai{q)wN  ,<t>}v*y  +  (A2w?  ,<t>)v*y2  +  (g(q)(J\fwN),  N<j>) 

=  (f(t',q),<t>)v'y2 

(1.9)  wN(0)  =  PN<p0,  w?( 0)  =  PN^ 

for  all  if  €  HN . 

Solution  of  these  approximate  estimation  problems  (1.7)— (1.9)  provides  one  with 
a  sequence  of  parameter  estimates  {qN,M}.  The  crucial  question  is  when  one  can 
guarantee  that  this  sequence  (or  some  subsequence)  converges  to  a  solution  of  the 
original  infinite  dimensional  parameter  estimation  problem.  Under  certain  suitable 
assumptions  on  the  approximating  spaces  Hh  and  approximating  sets  QM  this  ques¬ 
tion  is  answered  in  [BSW]  for  linear  systems  and  here  we  extend  these  ideas  to  include 
nonlinear  systems. 

To  permit  use  of  the  method  outlined  above  we  must  be  certain  that  the  above 
systems  (1.1)— (1.3)  and  (1.8)— (1.9)  have  solutions  in  some  sense  for  each  q  G  Q. 
This  well-posedness  problem  (without  considering  the  parameter  dependent  case) 
was  solved  in  the  recent  paper  [BGS],  In  the  following  section  we  summarize  these 
results  and  give  precise  conditions  under  which  (1.1)— (1.3)  has  a  unique  weak  solution 
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for  each  q  e  Q.  Then  in  Section  3  we  give  assumptions  on  the  general  parameter 
estimation  problem  that  we  shall  use  to  prove  convergence.  We  also  recall  Theorem 
5.1  from  [BSW]  that  provides  a  sufficient  condition  for  the  convergence  of  the  solutions 
{qN,M}  of  the  approximate  estimation  problems  to  a  solution  of  the  original  parameter 
estimation  problem.  Then  in  Section  4  we  show  that  this  condition  is  satisfied  in  our 
case  under  natural  assumptions  on  the  parameter  dependence  of  A\,A2 ,g  and  /. 

2.  Formulation  of  the  Problem 

We  assume  that  there  is  a  sequence  of  separable  Hilbert  spaces  V,  V2,  H,  V*,  Vf 
forming  a  Gelfand  quintuple  [BIW,  Wl]  satisfying 

(2.1)  V^v2^H^V2*^  V, 

where  we  assume  that  the  embedding  V  w  V2  is  dense  and  continuous  with  \y\v2< 
c\tp\v  for  ip  £  V  and  V2  H  is  a  dense  compact  embedding  with  \<p\  <  c\q>\v2 ■  The 
norm  in  H  will  be  denoted  by  |  - 1  while  those  in  V,  V2  etc.  will  carry  an  appropriate 
subscript.  We  denote  by  (  ,  )v\v,  etc.,  the  usual  duality  products  [Wl].  These 
duality  products  are  the  extensions  by  continuity  of  the  inner  product  in  H ,  denoted 
by  (  ,  )  throughout.  Let  Q  be  an  infinite  dimensional  parameter  set.  The  operators 
Ai(q)  and  A2{q)  are  defined  in  terms  of  their  sesquilinear  forms  cri(q)  :  V  x  V  — > 
C  and  a2(q)  :  V2  X  V2  ->  C.  That  is,  Ai(q)  G  C(V,V*),A2(q)  G  C{V2,V2*)  and 

(Ai(q)tp,ip)v,v  =  0i(?)(¥hVO>  ‘4>)v^,v2  = 

Let  Ct  denote  the  space  of  functions  w  :  [0,  T\  — >  H  such  that 

w€Cw([0,T},V2)nLx([0,T},V) 

(the  subscript  W  denotes  weak  continuity),  and 

wteCw([0,T],H)nL2([0,T},V2), 

where  the  time  derivative  wt  is  understood  in  the  sense  of  distributions  with  values 
in  a  Hilbert  Space  (see,  e.g.,  [Lil]).  The  space  CT  is  equipped  with  the  norm 

(2.2)  \w\or  =  ess  sup  {\wt(t)\  +  \w(t)\v)  +  (  [  \wt{t)\y2dt 

te[o,T]  V"'0 

Definition  2.1.  We  say  that  w  G  Ct  is  a  weak  solution  of  the  problem  (1.1)  -(1.3) 
if  it  satisfies  the  equation: 

jf*  [  -  (tflrW,  Vt(t))  +  <7 !(q)  (■ w(t ),  77(r))  +  a2(q)  ( wt{t ),  ij(r)) 

(2.3)  +(g(q)(J\fw{T)),N'r](T))  dr  +  (wt(t),  rj{t)) 

=  (tpur)(0)}  +  Jo  ( f{T;q),v(T))V*,v2dT , 
for  any  t  €  [0,  T]  and  any  q  G  CT,  as  well  as  the  initial  condition 

(2.4)  w(  0)  =  <p0. 
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Equivalently,  w  is  a  weak  solution  if 

(2.5)  (wtt,ri)v*y+(T1(q)(w,ri)+cr2(q)(wt,v)  +  (g(q)(Nw),N'ri)  =  {f{q),v)V2*,v2 

is  satisfied  for  all  g  G  Ct  and  for  almost  all  t  G  [0,  T}. 

To  establish  our  parameter  estimation  convergence  results,  we  first  make  the  fol¬ 
lowing  assumptions  (these  assumptions  are  the  same  as  in  [BGS]  except  that  here 
we  require  them  to  be  satisfied  uniformly  for  all  q  G  Q)  which  will  guarantee  well- 
posedness  for  all  q  £  Q. 

Al)  The  form  a^q)  is  a  Hermitian  sesquilinear  form:  for  tp,ip  G  V 

(2.6)  =(Ti(q){il},<p)  for  every  q  G  Q. 

A2)  The  form  <j\ (q)  is  V  bounded:  for  ip,4>  G  V 

(2.7)  \ai(q)((p,ip)  <  Ci\ip\v\ip\v  for  every  q  G  Q. 

A3)  The  form  ai(q)  is  strictly  V  coercive:  for  <p  G  V 

(2.8)  Re <7! (q) (tp,<p)  =  0i(q)(<p,  <p)  >  h\<fi\v>  h>0 
for  every  q  G  Q. 

A4)  The  form  a 2(q)  is  V2  bounded:  for  <p,r/>  e  V2 

(2.9)  \cr2(q)((p,ip)\  <c2\(p\vMv2  for  every  q  G  Q. 

A5)  The  real  part  of  a2(q)  is  V2  coercive  and  is  symmetric: 

(2.10)  Recr2(q)((p,<p)  +  \0\<p\2  >  k2\<p\y2  k2>  0,  Ao>0 

(2.11)  Rea2(q)(<p,ip)  =  Reo2(q)('if,<p),  for  any  ip,if  eV2,q  G  Q. 

A6)  The  forcing  term  f(q)  satisfies  /  G  T2([0, T],  14,*)  for  every  q  G  Q. 

A7)  The  operator  M  satisfies 

(2.12)  M  G  £( V2,  H)  with  \Nip\  <  \flz  \y\v2 

and  the  range  of  N  on  V  is  dense  in  H. 

Note  that  (2.12)  and  V  ^  V2  implies 

(2.13)  J\fe£{V,H)  with  \Af <p\  <  Vk  \(p\v 
with  k  =  c2k. 

A8)  The  nonlinear  function  g(q)  :  H  ->  H  is  a  continuous  nonlinear  mapping  of  real 
gradient  (or  potential)  type.  This  means  that  there  exists  a  continuous  Frechet- 
differentiable  nonlinear  functional  G(q)  :  H  ^  M1,  whose  R-echet  derivative 
G'(q)(ip)  G  C(H,  E1)  at  any  ip  G  H  can  be  represented  in  the  form 

(2.14)  G'(q)(<p)ip  =  Re(g(q)(ip),'f)  for  any  ipeH. 
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We  also  require  that  there  are  constants  C\,  C2,  C3  and  e  >  0  such  that 

(2.15)  -  \k-\h  -  e)\g>\ 2  -  Ci  <  G(q)(<p)  <  C2\<p\2  +  C3, 

for  every  q  €  Q,  where  k  is  from  (2.13)  and  k\  from  (2.8). 

A9)  The  nonlinear  function  g(q)  also  satisfies 

(2.16)  \g{q){p>)\  <  Ci\ip\  +  C2,  (peH, 

for  every  q  G  Q,  for  some  constants  C\,  C2. 

An  additional  condition  is  necessary  for  uniqueness  of  solutions. 

A10)  For  any  <p  G  H  the  Frechet  derivative  of  g(q)  exists  and  satisfies 

(2.17)  g'(q)(g>)  e  C(H,H)  with  \g'(q)(<p)\C(H,H)  <C3  for  every  q  e  Q. 

All)  We  assume  that  for  any  u,  v  6  CT,  the  following  inequality  is  satisfied  for  any 
t€  [0 ,T],qeQ: 

Jq  ^Re{g(q)(Afu(T))  -  g{q){Afv(r)),Afu(T)  -A Iv{t)) 

(2.18)  +  /c1fc“1|A/'«(r)  -  Wn(r)|2|  dt 

+  °((i 

where  a(£)  >  0  is  a  continuous  function  in  £  >  0  such  that 

i)  o(0)  =  0, 

ii)  there  exists  a  first  derivative  such  that  a'(0)  =  0. 

Note  that  (2.18)  is  satisfied  if,  for  example, 

(2.19)  Re(g(q)(<p)  -  g(q)(i>),  p>  -  ip)  +  fcifc-1|<^  -  ip\2  >  0 

for  any  ip,  ip  €  H,  where  k  and  k\  are  the  constants  in  (2.8)  and  (2.13).  Thus 
if  AT  =  L2{Q),  Q,  C  Rm,  so  that  g(q)  :  R  — >  M,  then  a  sufficient  condition  for 

(2.19)  is  that  g'(q)(£)  >  —l\  for  some  li  >  0. 

In  [BGS]  it  is  shown  that 

Theorem  2.1.  Under  conditions  A1)-A11)  the  system  (1A)-(1.3)  has  a  unique  weak 
solution  w  G  Ct  for  every  initial  condition  (y?o>¥h)  €  V  x  H.  The  weak  solution 
satisfies 

(2.20)  {wtt,r]}v^v+ai{q){w,r])+a2(q)(wt,r])  +  {g{q){Afw),Afg)  =  (f(q),g}v2\v2 

for  all  T)  e  CT,q  6  Q  and  for  almost  all  t  £  [0,  T].  Also,  w  £  CV([0,  T],  V2),  wt  G 
Cw([0,T],  H)  and  the  weak  solution  depends  continuously  on  initial  conditions. 
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3.  The  General  Parameter  Estimation  Problem 

Assume  that  we  have  a  set  of  observations  z  =  {zi}f=1  corresponding  to  measure¬ 
ments  taken  at  time  U.  As  stated  in  the  Introduction  we  would  like  to  find  a  solution 
for  the  least  squares  minimization  problem,  i.e.,  find  q  6  Q  that  minimizes 

(3.1)  J(q,z)  =  | C2  {Ci{w(ti,-;g)}  -  {z*}}|2, 

where  {w(tit  •;  g)}  are  the  parameter  dependent  solutions  of  (1. 1)— (1 .3)  evaluated  at 
time  ti,i  =  1,2, ...  K.  To  this  end,  we  consider  Galerkin  type  approximations  to 

(1.1) — (1.3)  and  define  a  family  of  approximating  parameter  estimation  problems. 

Let  HN  be  finite  dimensional  subspaces  of  H  and  let  QM  be  finite  dimensional  sets 

approximating  (in  a  sense  to  be  made  precise  below)  the  parameter  set  Q.  Let  PN 
denote  the  orthogonal  projections  of  H  onto  HN .  Then  the  approximate  parameter 
estimation  problems  can  be  stated  in  the  following  way:  find  q  E  QM  that  minimizes 

(3.2)  JN{q,z)  =  \C2{C\{wN(tir]q)}  -  {^}}|2, 

where  wN(t]  q)  G  HN  is  the  solution  to  the  finite  dimensional  approximation  of  (1.1)- 

(1.3)  given  by  (1-8)— (1.9). 

We  make  the  following  assumptions  for  the  spaces  HN  and  H  and  the  sets  QM 
and  Q  (see  [BSW]). 

Bl)  The  sets  Q  and  QM  lie  in  a  metric  space  Q  with  metric  d.  We  assume  that  Q  and 
QM  are  compact  in  this  metric  and  there  is  a  mapping  iM  :  Q  — >  QM  such  that 
QM  =  iM{Q).  Also,  for  each  q  €  Q,  iM(q)  — *  q  in  Q  with  the  convergence 
uniform  in  q  G  Q. 

B2)  The  finite  dimensional  subspaces  HN  satisfy  HN  C  V. 

B3)  For  each  ip  EV,  \ ip  -  PNip\v  — »  0  as  N  — >  oo. 

B4)  For  each  ip  E  V2,  \ip  —  PNip\v2  — >  0  as  N  — >•  oo. 

We  also  assume  that  Ai,A2,g,f  depend  continuously  on  the  parameter  q  E  Q, 
i.e.,  they  satisfy  the  following  conditions: 

Cl)  Wi{q)(<p,ip)  -  cn{q){<P,ip)\  <  lid(q,q)\<p\v\ip\v,  for  every  cp,ipEV. 

C2)  \cr2{q)(£,r])  -  a2(q)(t,v)\  <  lf2d{q,q)\^\v2\v\v2,  for  every  £,t?  G  V2. 

C3)  \g(q)((p)  -  g(q)(<P)\  <  nf3d{q,q)\(p\  for  all  <p  E  H. 

C4)  The  mapping  q  —>/(•;  q)  is  continuous  from  Q  to  L2([0,  T\,  Vf). 

Under  conditions  A1)-A11),  Bl),  Cl)-C4)  we  know  that  a  solution  {qN,M}  to 
the  approximate  parameter  estimation  problem  (1.7)— (1.9)  and  a  solution  q  to  the 
original  parameter  estimation  problem  for  (3.1)  exist.  A  general  sufficient  condition 
for  the  convergence  of  {qN,M}  to  q  is  given  in  Theorem  5.1  of  [BSW]  (see  also  [BK]): 

Theorem  3.1.  To  obtain  convergence  of  at  least  a  subsequence  of  {qN,M}  to  a  solution 
q  of  minimizing  (3.1)  subject  to  (\.1)-(1.3),  it  suffices,  under  assumption  Bl),  to 
argue  that  for  arbitrary  sequences  {qN,M}  in  QM  with  qN<M  —>  q  £  Q,  we  have 

(3.3)  C2C\wN(t-  qN,M)  -*  C2C\w(t;  q). 
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4.  Convergence  Results 

In  this  section  we  show  that  under  our  general  conditions  given  above,  the  con¬ 
vergence  criteria  (3.3)  of  Theorem  3.1  holds  for  a  reasonable  class  of  observation 
operators  Ci,C2. 

Theorem  4.1.  Suppose  that  assumptions  Al)-All),  Bl)-B4)  and  Cl)-C4)  are  sat¬ 
isfied.  Let  qN  be  arbitrary  in  QN  such  that  qN  — >  q  £  Q  as  N  — >  oo.  Then  we 
have 

wN(t,qN )  — >  w(t,q)  in  V,  for  allt>  0 


and 

wfi(t,  qN)  — ►  wt(t,  q )  in  H  for  all  t  >  0  and  in  V2  for  almost  all  t>  0, 
where  wN ,  wfi  satisfy 

{w^t(t),f))v.y  +  o2(qN)(wfi{t),4>)  +  a1(qN)(wN(t),4>)  +  (g(qN)(AfwN),Affi) 

(4.1)  ww(0)  =  PNv o  ,  <(0)  =  PNV i 

for  all  (f>  6  HN ,  and  w,wt  satisfy 

(wtt(t),  <f>)v*,v  +  cr2(<?)(u’i(0i  0)  +  ( q){w(t ),  <f>)  +  {g{q) (Afw) ,  Af<j>) 

=  (f(t,  q),(f>)v2\v2 

(4.2)  w(0)  =  ,  wt(0)  =  <pi 

for  all  (f>  £  V. 

Proof:  We  know  that  w(t)  €  V,  wt(t)  €  H  for  all  t  >  0  and  wfit)  €  for  almost  all 
t>  0.  By  the  triangle  inequality 

| wN{t,  qN )  -  w(t,  q)\v  <  \uiN(t,  qN )  -  PNw(t,  q)\v  +  \PNw{t,  q)  -  w(t,  q) \v. 

By  assumption  B3)  the  second  term  on  the  right  side  goes  to  0  as  iV  — *  oo.  So  to 
prove  our  statement  about  wN(t ,  qN)  it  is  enough  to  show  that  the  first  term  on  the 
right  side  also  goes  to  0  as  N  — »  oo.  Similarly, 

\w?(t,qN)  -wt{t,q)\v2  <  | wfi(t,qN)  -  PNwt(t,q)\V2  +  \PNwt{t,q)  -  wt{t,q)\v2. 

The  last  term  again  goes  to  0  by  B4),  so  to  prove  our  statement  it  is  enough  to  show 
that  the  first  term  also  converges  to  zero. 

Let  us  introduce  the  following  notation: 


wN  =  wN(t,  qN),  w  =  w(t,  q )  and  A"  =  wN(t,  qN)  -  PNw(t ,  q). 
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Then 

Af  =  wn  _  ~PNw  =  wf  -  PNwt, 
Af  =  WN  _  f_pNw 

it  tt  dt2 

since  uut  G  L2([ 0,  t],  V2). 

Prom  (4.1)  and  (4.2)  we  have  that  for  every  ip  G  HN: 


(A u,Tp)v*,v  =  (w%  -  wtt  +  wtt  -  -^PNw,ip)vy 

(4  3)  =  ~  CJ^qN^wt^  ~  ffi iQN)(wN,ip) 

-  {g{qN)AfwN ,Mip)  -  (. f{q),ip)v*,v2  +  o2{q){wuip)  +  ax {q){w,ip) 

d? 

+  (g(q)-^w,ATip)  +  (wa  -  jpPNw,ip)v,y. 

By  adding  and  subtracting  we  obtain  for  all  ip  G  Hh 

(Af,^)y.y  +  <*i(qN)(^N ,V>)  =  (wtt-  ^pPNW,1p)v,V 

-  (/(«)-  f(qN),i>)v2*y2 

(4-4)  +  a2(qN)(wt  -  PNwt,ip)  +  <j2{q){wt,ip)  -  a2(qN)(wt,ip ) 

+  <7i {qN)(w  -  PNw,  ip)  +  cti (q){w,  ip)  -  a1(qN)(w,  ip) 

-  a2(qN)(A?  ,ip)  +  {g{q)Mw,ATip)  -  {g{qN)AfwN ,ATip). 

We  may  choose  ip  =  Af  since  Af  G  HN.  Then  (Af,  A ?)v\v  —  |;||Af|y.  As  in 
[BSW]  we  find: 

|  (<|Af|^  +u1(^)(AiVJ  Aff))  =  2i?e{<in„  -  ^Ptfw,A?)v.y 

-  (f{q)  -  f{qN),A?)v-y2 

(4-5)  +  a2(qN)(wt  -  PNwt,  Af )  +  a2(g)(wt,  Af )  -  a2{qN){wt,  Af ) 

+  cti ($*)(«>  -  Af)  +  oi(g)K  Af )  -  cri  ($*)(«;,  Af ) 

-  a2(qN)(  Af ,  Af)  +  {g{q)Nw,NA”)  -  (g(qN)ATwN ,  AT  A? )}. 

We  denote  the  left  side  of  (4.5)  by  L(t)  and  the  right  side  by  R(t).  Integrating  L 
from  0  to  t,  using  initial  conditions 

A*(0)  =  to*(0)  -  PNw{ 0)  =  to"(0)  -  PNipo  =  0 


and 

Af  (0)  =  tof(0)  -  PNwt{ 0)  =  inf  (0)  -  PNVl  =  0, 
along  with  A3),  we  have 


(4.6) 


[  L(s)ds  >  | Af  \v  +  ki\AN\y 
Jo 
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We  next  argue  that 

(4.7)  f  R(s)ds  <  fij)^ (t)  +  /12  /  +  ki\AN\yds, 

JO  JO 

where  6N(t)  — *  0  as  N  — >  oo  and  /ri,p,2  are  positive  constants.  Then  by  Gronwall’s 
inequality  we  obtain  that 

ki\AN(t)\y  +  |Af  (t)\y  — >  0  as  N  —>  oo  for  each  t>  0, 

which  implies  the  desired  results. 

Proceeding  as  in  [BSW] ,  we  have  that 

[l  R(s)ds  <  v\8N(t)  +  Vi  [l  |Af|v  +  kx\AN\lds 

(4.8)  Jo  .  Jo 

+  2  Re  j  ((g(q)Afw,AfA?)  -  (g(qN)AfwN ,AfA? ))  ds, 

where 

6N(t)  =  jf  i?e  {K  -  AjV.v  +  |/(<?)  -  /(^)l^ 

(4-9)  +  c^|rot  -  FV&  +  lld2{q,  qN)\wt\v2 

+  c2\w  -  PNw\v  +  'Jid2{q,qN)\w\v}ds, 

and  8N(t)  ->  0  as  N  ->  oo  by  (5.18)  of  [BSW],  B3),  B4),  properties  of  w,wt  and 
assumptions  of  the  theorem.  Finally,  we  need  to  show  that  the  last  integral  containing 
the  terms  involving  g  can  also  be  estimated  from  above  by  an  expression  similar  to 
the  right  side  of  (4.7).  We  may  argue 

|  f{g{q)NwMA?)  -  (g(qN)AfwN,MA?)ds\ 

Jo 

<  \f*(g(q)Mw-g(qN)Mw,MA?)  +  (g(qN)Afw-g(qN)AfwN,AfA?)ds\ 

(4.10)  <  \  (  (nf3<P(q><lN)k\w\v  +  k\A?\v)ds 
2i  J  o 

+  I  f{g{qN)Uw  -  g{qN)NPNw,M^)ds\ 

Jo 

+  !  [\g(qNWPNw  -  g(qNWwN:MA?)ds\. 

Jo 


Now  the  first  integral  on  the  right  is  dominated  by  the  right  side  of  (4.7)  (with  suitably 
chosen  constants).  To  estimate  the  last  two  integrals  we  use  the  same  method  as  in 
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[BGS].  We  have 

I  f  {g{qN)Mw  -  g{qN)MPNw,M^)ds\ 

Jo 

=  |  /(/ g'(9J\fw(s)+(l  — 9)AfPNw(s))[Afw(s)  —  AfPNw(s)]d9,J\f  A^)ds\ 

(4.11)  Jy° 

<  [  C3\Afw-AfPNw\\AfA?\ds<^  [ (ClWw-AfPNw\2+\NA?\2)ds 
Jo  2  Jo 

<\k  [  Cl\w  —  PNw\yds +  \k  (  |Af|yds. 

2  Jo  2  J  o 

Here  the  first  term  in  the  last  expression  goes  to  0  as  N  — ►  oo  by  B3)  and  properties  of 
w,  while  the  second  is  dominated  by  second  term  on  the  right  side  of  (4.7).  Similarly, 

I  f\g(qN)MPNw  -  g{qN)MwN ,MA?)ds\ 

(4.12)  J\  t  t 

<\kj  C2\PNw  -  wN\l  +  |Af  1 2vds  <  ifc  [  C2\AN\2V  +  |Af  1 2vds, 

2  Jo  2  J  o 

which  again  is  dominated  by  the  right  side  of  (4.7).  This  completes  the  required 
arguments.  □ 

We  note  that  the  above  theorem  gives  a  computationally  tractable  method  to 
solve  the  parameter  estimation  problem  involving  (3.1)  in  case  the  data  collected 
consists  of  displacement  or  velocity  measurements,  i.e.,  C\  is  either  the  identity  or 
differentiation  with  respect  to  time  once  followed  by  evaluation  in  t  and  x.  However, 
the  case  of  accelerometer  data  is  more  complicated,  since  then  Theorem  3.1  requires 
(t]  qN)  — >  wtt(t;q )  in  V*  for  t  £  [0,T].  We  will  now  give  conditions  under  which 
this  convergence  can  be  obtained. 

Let  us  suppose  that  V2  =  V,  i.e.,  we  have  strong  damping,  such  as  Kelvin- Voigt 
damping  in  the  example  given  in  the  Introduction.  We  can  formulate  the  system 
(1.1)— (1.3)  in  variational  form  (4.2)  and  rewrite  it  in  first  order  vector  form  on  H  = 
V  x  H  in  the  coordinates 


We  define  V  =  V  x  V  and  a  (q)  :  V  x  V  ->  C  by 

°-(q)  ((  l  )’  (  t  ))  =  “(’h ^  +  CTl (?)(£> ^)  +  f72(g)07,V’)- 

Then  (4.2)  can  be  rewritten  as 

(4.13)  (zt,  $)  +  a{q)(z,  <L)  =  (F(g),  4>)  for  all  <f>  e  V 

(4.14)  z(0)  = 

F ^  ~  (  f(q)  -Af*g (q){Nzi) 


where 
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We  can  also  write  this  in  the  equivalent  operator  form  (not  distinguishing  between 
row  and  column  vectors): 

(4.15)  zt  =  A(q)z  +  F(q) 


(4.16) 


where  er(g)(<f>,  lb)  =  (— A(g)<E>,  \t)v*,v  with 

,,  ,  _  [  0  I 

A[q)  ~  [  -AM  ~A2(q)  ■ 

It  is  shown  in  [BIW,  BSW]  that  if  V  =  V2,  then  A(q)  generates  an  analytic  semigroup 
S(t;  q)  on  V*  =  V  x  V*.  Then  the  weak  solution  of  (1.1)— (1.3)  can  be  represented  in 
the  form: 

(4.17)  *(‘U)  =  ( 

Letting  TtN  =  HN  x  HN ,  we  can  restrict  <r(g)  to  7iN  x  HN ,  denote  this  restriction 
by  aN  and  define  AN(q )  :  HN  ->  HN  by  (AN(q)$,  $)  =  (JAr($,  T)  for  all  ^  G 
Hn  x  Hn.  Then  AN  generates  an  analytic  semigroup  SN(t ;  q)  on  HN.  Solutions  of 
(1.8)— (1.9)  can  then  be  represented  as: 


(4.18)  ^.)-(^j|)- 


=  SN(t-,q) 


PNV  o 

PN ip. 


+  [tSN(t-r-,q)PNF(r-,q)dT, 
Jo 


where  PNF  is  understood  to  mean 


PNF i 
PnF2 


if  Fi  are  the  components  of  F  and 


pN  .  y*  -»  Hn  is  the  generalized  projection  (in  the  sense  of  the  duality  product). 
We  can  then  use  the  theory  developed  in  [BR]  to  obtain  z^(t;qN)  — >  zt{t\q )  when 
qN  — t-  q.  Note  that  this  will  guarantee  that  w^t{t\qN)  — >  q)  in  V*  for  t  £  [0,T] 
(which  is  what  we  desired).  According  to  Theorem  3.1,  3.2  in  [BR]  and  the  remarks 
following  them,  this  convergence  is  guaranteed  if  we  can  argue  convergence  of  (4.18) 
to  (4.17)  after  differentiation  of  these  terms  with  respect  to  time.  Since  we  have 
analytic  semigroups  we  obtain  this  property  if  F(q)  £  Z°°([0,T],  W)  (i.e.,  pointwise 
defined  and  bounded  V*  valued  functions)  and  PNip  — >  in  V*.  Thus,  we  can  state 


the  following  theorem: 


Theorem  4.2.  Let  V  =  V2  and  f(q)  £  L°°([0,T],  V*)  in  the  system  (l.l)-(l.Z).  Let 
A1)-A11),  Bl)-B4),  Cl)-C4)  hold.  Morover,  assume  that  PNip  — >  ip  in  V*  for 
(p  £  V*.  Then  for  any  qN  — >  q  £  Q  we  have  w^(t;qN)  —>  wa{t;q)  in  V*  for 
t  £  [0, T}. 


Proof:  Using  the  arguments  in  [BR]  with  (4.17),  4.18  we  only  need  to  argue  that 
f{--,q)  -  Af*g(q)(Afz1)  £  Z°°([0,  T],  V*).  But  this  follows  from  the  fact  that  /(g)  £ 
L°°([0,T],W),  z  =  w  £  CV([0,T],U2)  and  J\f  £  C(V2,  H).  □ 
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Preconditioners  for  Karush-Kuhn-Tucker  Matrices 
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Abstract.  In  this  paper  preconditioners  for  linear  systems  arising  in  interior-point  meth¬ 
ods  for  the  solution  of  distributed  control  problems  are  derived  and  analyzed.  The  matrices 
K  in  these  systems  have  a  block  structure  with  blocks  obtained  from  the  discretization  of 
the  objective  function  and  the  governing  differential  equation.  The  preconditioners  have  a 
block  structure  with  blocks  being  composed  of  preconditioners  for  the  subblocks  of  the  sys¬ 
tem  matrix  K.  The  effectiveness  of  the  preconditioners  is  analyzed  and  numerical  examples 
for  an  elliptic  model  problem  are  shown. 

1991  Mathematics  Subject  Classification.  49M30,  49N10,  90C06,  90C20 

Key  words  and  phrases.  Preconditioners,  iterative  methods,  interior  point  methods,  linear 
quadratic  optimal  control  problems. 


1.  Introduction 

The  discretization  of  distributed  linear  quadratic  optimal  control  problems  with 
bound  constraints  on  the  controls  and  on  the  states  leads  to  large  scale  quadratic 
programming  problems.  Because  of  their  complexity  and  convergence  properties, 
interior  point  methods  are  attractive  solvers  for  such  problems.  They  are  iterative 
methods  which  in  each  iteration  generate  approximations  to  solutions  that  are  strictly 
feasible  with  respect  to  the  bound  constraints.  Within  each  iteration,  large  indefi¬ 
nite  linear  systems  have  to  be  solved.  If  interior  point  methods  are  applied  to  linear 
quadratic  control  problems  governed  by  partial  differential  equations,  then  iterative 
techniques  usually  have  to  be  applied  to  solve  these  linear  systems.  To  make  inte¬ 
rior  point  methods  efficient,  it  is  important  to  solve  these  linear  systems  efficiently. 
Krylov  subspace  methods  are  iterative  linear  system  solvers,  which  are  very  suitable 
in  this  context.  They  do  not  require  the  system  matrix  in  explicit  form,  but  only 
require  matrix  vector  multiplications.  This  is  very  useful  since  for  the  problems  under 
investigation  the  system  matrices  have  a  block  structure  in  which  blocks  are  related 
to  discretized  differential  equations.  The  convergence  of  Krylov  subspace  methods 
depends  on  the  distribution  of  the  eigenvalues  of  the  system  matrix.  Roughly  speak- 


*This  author  was  supported  by  the  NSF  DMS-9403699,  AFOSR  F49620-93-1-0280,  and  in  part 
by  the  DoE  DE-FG03-95ER25257. 


Preceding  Page  Blank 


16 


Preconditioners  for  Karush-Kuhn-Tucker  Matrices 


ing,  their  convergence  is  the  better  the  more  the  eigenvalues  of  the  system  matrix 
are  clustered  and  the  smaller  the  clusters  are.  Ill-conditioning  of  the  matrix,  i.e.  a 
large  quotient  of  largest  absolute  eigenvalue  divided  by  smallest  absolute  eigenvalue, 
typically  corresponds  to  a  poor  convergence  of  Krylov  subspace  methods.  To  improve 
the  convergence  of  these  methods  nonsingular  matrices  are  constructed  so  that  the 
similarity  transformation  with  these  matrices  leads  to  a  system  with  better  clustered 
eigenvalues.  These  matrices  are  called  preconditioners.  The  purpose  of  this  paper  is 
the  construction  of  such  preconditioners  for  systems  arising  in  interior-point  methods 
for  certain  distributed  control  problems. 

To  illustrate  the  issues,  we  consider  the  following  elliptic  model  problem. 


mm  - 
2 


-»(*))’  + 1  j£„  «’(*)<<» 


over  all  (y,  u)  satisfying  the  state  equation 
,,  -A y(x)  +  y{x)  =  f(x) 

^ '  £y(x)  =  u(x ) 


iesi, 

x  £  dfl 


and  the  bound  constraints 


Diow  ^  yipi)  —  Vupp  a.e., 
ttiow  ~  u(x)  ^  'U'upp 

A  discretization  of  the  problem  with,  say,  finite  elements,  leads  to  a  quadratic  pro¬ 
gramming  problem  of  the  form 


min  ^ ylMyyh  +  ^ vThMuuh  +  cTyh  +  <fuh 


subject  to 


(1.5)  Ayh  +  Buh  =  6, 

Vh,low  —  Vh  —  Vhptppi 

(1.6) 

'U’hjow  —  'U'h  —  /U'h,upp' 

Here  h  indicates  the  mesh  size  of  the  discretization  and  uh  £  M"“ ,  yu  G  R"v  represent 
the  discretized  controls  and  states,  respectively.  The  matrices  Mu  £  M™"x"u  and 
My  £  Rn*'xn»  are  positive  definite.  The  vectors  yh,iow ,  ■  ■  ■  ,Uh,uPp  are  obtained  from 
the  bound  constraints  (1.3)  in  a  straightforward  way. 

There  are  various  classes  of  interior  point  methods.  They  all  (after  possible  trans¬ 
formations)  require  the  solution  of  linear  systems  with  system  matrices 

(  Hy  0  AT  \ 

(1.7)  K  =  0  Hu  BT  , 

\  A  B  0  J 

where 


Hy  =  My  +  Dy,  Hu  =  7M„  +  Du, 
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with  some  positive  semidefinite  diagonal  matrices  Dy  and  Du.  Since  the  matrix  K 
is  related  to  matrices  arising  in  the  Karush-Kuhn-Tucker  optimality  conditions,  we 
call  K  a  Karush-Kuhn-Tucker  (KKT)  matrix. 

Even  though  the  exact  form  of  the  diagonal  matrices  Dy  and  Du  differs  from  interior 
point  method  to  interior  point  method,  they  all  have  in  common  that  diagonals  of 
Dy  and  Du  grow  unbounded  if  the  corresponding  components  of  yh  or  Uh  converge 
towards  a  bound. 

The  matrices  K  arising  in  interior-point  methods  for  the  solution  of  problems  like 
(1.1)— (1.3)  are  usually  ill-conditioned.  There  are  at  least  two  sources  for  the  ill- 
conditioning.  One  source  is  the  discretization  of  the  infinite  dimensional  problem. 
Typically,  the  eigenvalues  of  K  spread  out  towards  zero  if  the  discretization  is  refined. 
The  second  source  are  the  large  diagonals  in  Dy  and  Du  that  arise  if  variables  ap¬ 
proach  the  bound.  This  source  is  due  to  the  interior-point  method.  Ill-conditioning 
also  arises  if  the  original  infinite  dimensional  problem  is  ill-posed.  The  precondi¬ 
tioners  derived  in  this  paper  are  designed  to  remedy  the  ill-conditioning  arising  from 
the  first  two  sources.  They  use  the  block  structure  of  K  and  are  composed  of  pre¬ 
conditioners  for  the  blocks  My,  Mu,  and  A  of  K.  This  allows  the  use  of  known 
preconditioners  for  the  governing  differential  equations.  Moreover,  computationally 
expensive  parts  of  the  preconditioner  have  to  be  computed  only  once  during  the 
interior-point  method,  since  only  the  diagonal  contributions  Dy  and  Du  change  from 
one  interior-point  iteration  to  another. 

Preconditioners  for  problems  related  to  this  one  are  investigated  in  other  papers. 
There  are  several  papers,  e.g.  [5],  [15],  [17],  investigating  preconditioners  for  systems 
arising  in  the  numerical  solution  of  partial  differential  equations  such  as  the  Stokes 
equations,  or  the  biharmonic  equation.  These  systems  can  also  be  viewed  as  KKT 
systems.  However,  the  blocks  in  those  matrices  are  different  and,  therefore,  the 
preconditioners  for  those  problems  are  different  than  the  ones  introduced  here.  In 
fact,  if  the  governing  equations  would  be  the  Stokes  equations,  or  the  biharmonic 
equation,  then  the  preconditioners  in  the  papers  cited  above  could  be  used  as  blocks 
in  the  preconditioners  introduced  here.  Some  of  the  tools  provided  in  those  papers, 
in  particular  a  result  from  [15],  cf.  Lemma  5.1,  are  heavily  used  in  our  analysis. 
Preconditioners  for  interior-point  methods  for  linear  programs  (LP)  are  investigated 
in  [9],  [10].  Those  preconditioners  are  for  general  LPs  and  are  based  on  sparse  matrix 
factorizations  or  on  the  SOR  method.  Since  no  particular  structure  is  assumed,  those 
papers  do  not  contain  any  theoretical  result  on  the  quality  of  the  preconditioner. 

This  paper  is  organized  as  follows.  In  the  first  part  we  study  the  QP  problem.  Sec¬ 
tion  2  investigates  the  problem  (1.1)— (1.3)  and  its  discretization.  The  Sections  3  and  4 
discuss  the  optimality  conditions  for  the  QP  (1.4)-(1.6)  and  some  aspects  of  interior- 
point  methods  relevant  for  the  construction  of  preconditioners.  Section  5  contains 
some  essential  results  about  the  Krylov  subspace  methods  MINRES  and  SYMMLQ. 
The  preconditioners  are  introduced  and  analyzed  in  Section  6.  This  section  also 
contains  some  numerical  tests  demonstrating  the  quality  of  the  preconditioners. 
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2.  The  Control  Problem 

As  noted  in  the  introduction,  one  source  of  ill-conditioning  in  the  KKT  matrix 
is  the  discretization  of  the  infinite  dimensional  problem.  This  section  provides  some 
results  needed  to  address  this  aspect  of  the  problem.  These  results  can  be  proven  for 
a  general  class  of  problems,  which  include  the  model  problem  (1.1)— (1.3)  as  a  special 
case.  In  this  section  we  do  not  consider  the  control  or  the  state  constraints. 

2.1.  The  Abstract  Problem.  Let  y  and  U  be  Hilbert  spaces.  These  spaces  play  the 
role  of  the  state  and  the  control  space,  respectively.  Moreover,  let  a,  b  be  continuous 
bilinear  forms  on  T  x  y  and  U  x  T,  respectively.  In  addition,  we  assume  that  a  is 
3^— elliptic.  In  particular,  there  exist  constants  a  >  0  and  j3  >  0  with 

at\\y\\y  <  a(y,y),  b(u,y)  <  f3\\u\\u\\y\\y,  Vy  G  34  ueU. 

Furthermore,  let  2  be  a  Hilbert  space  and  C  G  C(y,2).  In  particular  there  exists 
(  >  0  such  that 

\\Cy\\z  <  ClMly  Vy  G  y. 

With  some  linear  functional  l  on  y  we  consider  the  problem 

(2.1)  min  ^\\Cy  -  zd\\%  +  ||M|£, 

(2.2)  s.t.  a(y,v)  +  b(u,v)  =  l(v)  Vv  G  y. 

Results  on  the  existence  of  solutions  for  problems  like  (2.1),  (2.2)  are  given  e.g.  in 
[1],  [13]  and  we  refer  to  those  books. 

We  consider  the  following  discretizations.  Let 

34  =  span]>i,...  ,(f>ny}  C  y,  Z4  =  span{yfi,...  ,tpnJ  CM, 

and  define  matrices  A  G  MnvX"y  and  B  G  R"yXn“  by 

Aij  o((^j,  <^j),  i,j  1,  .  .  .  ,  Tlyi 

Bij  =  b(lpj,  <pi),  j  —  1,  .  .  •  ,  Tlu,  i  =  1,  .  .  .  ,  Tly , 

and  matrices  My  G  MnyXn«  and  Mu  G  Rn“xn“  by 

( 1  by ) ij  =  (C^C^Z,  i,j  1,  .  .  .  ,  Uy, 

( Aly ) ij  —  LJ  1,  .  .  .  jTlu. 

Obviously, 

Tly  Tlu 

yl AlyUh  —  II  yh,jC<Pi\\zi  uhMuuh  —  ||  Uh.jlpiWlf 

i= 1  i= 1 

In  particular,  the  matrix  Mu  is  positive  definite  and  the  matrix  Mv  is  positive  semidef- 
inite. 

By  ||  •  ||  we  denote  the  Euclidean  norm  in  for  some  k.  We  can  show  the  following 
simple,  but  important  result. 

Lemma  2.1.  There  exists  a  constant  c  >  0,  independent  of  the  discretization  param¬ 
eter  h,  such  that 

||My1/2A-1BM-1/2||  <  c. 
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Proof.  Let  %  ^  0  be  arbitrary  and  set  yh  =  A  1BMU  1^2Uh,  U] x  =  Mu  1/2Uh-  Define 
y  =  EiZiVKifa  and  u  =  Yfl=i  uhph-  By  definition  of  A  and  B,  a(y,<j>i)  =  b(u,<j>i), 
i  =  1, . . .  ,ny.  Hence, 


a\\y\\y  <  a{y,y)  =  b(u,y)  <  /?||«My||;y. 


This  implies 


||  m^a^bm-^ww 
IKII2 


< 


\\M^yh\\2  \\Cy\\% 

IKII2  INI2 

<2||yHy  =  M  ^ 

\\Ml,2uhf  Mu  o? 


□ 


2.2.  The  Model  Problem.  The  model  problem  (1.1)— (1 .3)  fits  into  the  above  frame¬ 
work,  if  we  use  the  weak  formulation  of  (1.2).  The  Hilbert  spaces  are  y  =  H1( fl), 
U  =  L2(9Q),  and  Z  =  L2(Pl).  The  bilinear  forms  and  the  functional  are  a(y,v )  = 
fnVy(x)Vv(x)  +  y(x)v(x)dx,  b(u,v )  =  -  fxiu(x)v(x)dx,  and  l(v)  =  fnf(x)v(x)dx. 
The  operator  C  is  the  imbedding  operator.  For  our  discretization  we  use  a  finite  el¬ 
ement  discretization  with  piecewise  linear  functions  over  triangles.  In  our  numerical 
experiments  we  use  U  =  (0,  l)2  and  we  construct  the  triangulation  as  follows:  The 
x-  and  y-  intervals  are  subdivided  into  dx  and  dy  subintervals.  The  resulting  rect¬ 
angles  are  subdivided  into  two  triangles  by  connecting  the  lower  left  corner  and  the 
upper  right  corner  of  the  rectangle.  Since  piecewise  linear  approximations  are  used, 
the  number  of  state  variables  is  ny  =  (dx  +  1  )(dy  +  1)  and  the  number  of  controls  is 
nu  —  2  (dx  T  dyf 


3.  The  Quadratic  Programming  Problem 


We  consider  the  following  quadratic  programming  problem  (QP)  in  standard  form: 

(3.1)  Minimi  (l)T(ZZ 
subject  to 

(3.2)  Ay  +  Bu  =  b, 

(3.3)  y  >  0,  u  >  0. 


In  this  section  the  origin  of  the  QP  is  not  important  and  we  omit  the  subscript  h. 
Moreover,  we  absorb  7  into  Muu.  The  standard  form  (3.1)— (3.3)  is  considered  to  re¬ 
duce  the  complexity  of  notation.  Using  straightforward  extensions,  bound  constraints 
of  the  form  (1.6)  can  be  handled  as  well.  Throughout  this  section  we  use  the  notation 


M  = 


Myy  MyU  \ 
MUy  Muu  )  ’ 


C  =  (A\B),  «-(j). 


We  limit  our  discussion  to  convex  problems  and  assume  that  M  is  positive  semidef- 
inite.  The  existence  of  solutions  of  the  QP  (3.1)-(3.3)  is  guaranteed  if  the  objective 
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function  is  bounded  from  below  on  the  set  of  feasible  points.  More  precisely,  we  have 
the  following  well-known  result  (e.g.  [6,  §  12.3]): 


Theorem  3.1  (Necessary  and  Sufficient  Optimality  Conditions).  If  M  is  positive 
semidefinite  and  if  q(x)  =  \xTMx  +  gTx  is  bounded  from  below  on  the  set  of  feasible 
points  {( y,u)\Ay  +  Bu  =  b,  y>0,u>0},  then  the  QP  (3.1)-(3.3)  admits  a  solution 
x* .  If  M  is  positive  definite,  the  QP  admits  a  unique  solution. 

The  vector  ( y,u )  is  a  solution  of  (3.l)-(3.3)  if  and  only  if  there  exist  p  G  Rn», 
qy  g  Rn»,  and  qu  G  Mn“  such  that  the  Karush-Kuhn-Tucker  (KKT)  conditions 


(3.4) 


A dyyV  4“  A IyUU  A  P  Qy  d, 

AIU yy  -{-  Muuu  4 ~  B  p  qu  c, 

Ay  4-  Bu  =  b, 
yTqy  +  uTqu  =  0, 

Qy  >  Qu  ^  1 

y,u  >  o 


are  satisfied. 


To  learn  more  about  the  QP  and  the  optimality  system  (3.4)  it  will  be  helpful  to 
distinguish  three  cases.  This  discussion  will  also  help  us  to  relate  the  results  in  this 
paper  to  the  results  on  the  solution  of  KKT  systems  in  interior-point  methods  for 
linear  programming  that  can  be  found  in  the  literature,  see  e.g.  [10]. 

Throughout  this  subsection  we  assume  that  A  is  nonsingular  and  that  the  QP  has 
a  solution.  As  a  consequence,  the  matrix  C  —  (A  \  B)  has  full  row  rank  and  the 
KKT  system  (3.4)  has  a  solution. 

Bound  constraints  for  u  and  y.  Let  (yt,  14)  be  a  solution  of  the  QP.  Further¬ 
more,  let  {If, . . .  ,  lkJ  and  {l\, ...  ,  lvky}  denote  the  set  of  active  indices  for  u*  and  j/», 
respectively, 

{If, ...  ,lfj  =  {i  |  («,),  =  0},  {l\, ...,  lvky)  =  {<  |  (y*)i  =  0}. 

The  Lagrange  multipliers  at  the  solution  satisfy 

(<fe)i  =  0,  lvky}  and  (qu)i  =  0,i<f{lf,...  ,lukJ. 


If  we  define  the  matrices  I(y »)  G  Mfci,xn!',  I{ufij  G  Rfcu><n"  by 


( I{y*))ij  ~ 


1  if  j  —  if, 

0  otherwise, 


and 


( I(.u*))ij  ~ 


{ 


1 

0 


if  3=  If, 

otherwise, 


then  the  KKT  conditions  (3.4)  are  equivalent  to 


(3.5) 


/ 

Myy 

Myu 

AT 

i(y*)T 

0  ^ 

(  y  \ 

(  ~c  \ 

Muy 

Muu 

BT 

0 

I{uQT 

u 

-d 

A 

B 

0 

0 

0 

p 

— 

b 

i{y*) 

0 

0 

0 

0 

Qy 

0 

V 

0 

I(uf) 

0 

0 

0  j 

\  Qu  ) 

\  0  ) 

where  qy,  qf  denote  the  Lagrange  multipliers  corresponding  to  the  active  indices. 
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Let  l  denote  the  number  of  positive  components  in  the  solution  (y*,  u*)  of  the  QP. 
The  assumption  that  A  is  nonsingular  is  not  sufficient  to  guarantee  that  the  matrix 


(3.6) 


(A  B  \ 

C  =  I(y *)  0  e  ]£("»+(**»+"«— Q)x(n»+n«) 

V  o*  /(«.)/ 


has  full  row  rank.  If  C  does  not  have  full  rank,  then  the  system  (3.5)  does  not  have 
a  unique  solution,  even  if  the  QP  has  a  unique  solution  (y*,ut).  It  is  not  difficult  to 
see  that  in  this  case  the  Lagrange  multipliers  (p,  <?“,  q“)  are  not  uniquely  determined. 

If  M  =  0,  then  the  QP  reduces  to  an  LP.  In  this  case  the  solution  of  the  optimiza¬ 
tion  problem  can  be  found  in  a  vertex  (y*,w*).  Recall  that  a  feasible  point  (y,u)  is 
called  a  vertex  if  the  columns  of  C  =  (A  \  B)  corresponding  to  the  positive  compo¬ 
nents  are  linearly  independent,  see  e.g.  [6,  §  2].  If  (y*,u*)  is  a  vertex,  at  most  ny 
components  of  (y*,  u,)  can  be  positive  and  the  columns  of  C  =  {A  \  B)  corresponding 
to  the  positive  components  of  the  vertex  (y*,  u»)  are  linearly  independent.  If  less  than 
ny  components  of  (y*,u*)  are  positive,  the  vertex  is  called  degenerate,  see  e.g.  [6,  §  2]. 
In  the  nondegenerate  case,  i.e.  if  l  =  ny  components  of  (y*,u*)  are  positive,  then  the 
matrix  C  has  full  row  rank.  In  the  degenerate  case,  however,  l  <  ny  components  of 
(y*,  ut)  are  positive.  Thus,  2  ny  +  nu-l>  ny  +  nu  and  the  matrix  C  cannot  have  full 
row  rank.  Hence,  the  solution  is  degenerate  if  and  only  if  C  does  not  have  full  row 
rank. 

Bound  constraints  for  u.  Let  (y*,u*)  be  a  solution  of  the  QP  and  suppose  that 
no  bound  constraints  are  imposed  on  y*  or  that  the  bound  constraints  for  y*  are  not 
active.  In  this  case, 


C  = 


A  B  \ 

0  I(ut)  )  ■ 


Since  A  is  nonsingular,  C  has  full  row  rank.  Therefore,  the  system  (3.4)  is  uniquely 
solvable  if  the  matrix  M  is  positive  definite  on  the  null-space  of  C. 

In  the  LP  case,  i.e.  M  =  0,  the  solution  can  be  found  in  a  vertex  (y»,  it*).  Since,  by 
assumption,  y*  >  0  and  A  is  nonsingular,  we  can  conclude  that  u *  =  0.  Consequently, 
/(«»)  G  is  the  identity  matrix.  In  the  language  of  linear  programming,  y, 

are  the  basis  variables  and  u*  are  the  nonbasis  variables.  Thus,  this  case  always 
corresponds  to  the  nondegenerate  case  in  linear  programming. 

No  bound  constraints.  If  the  bound  constraints  are  not  active,  then  the  Lagrange 
multipliers  qy  and  qu  are  zero  and  the  KKT  conditions  (3.4)  are  equivalent  to  the 
system  (3.5)  with  the  last  two  row  and  column  blocks  of  the  system  matrix  removed. 
If  the  matrix  M  is  positive  definite  on  the  null-space  of  C,  the  system  (3.4)  has  a 
unique  solution. 


4.  Interior-Point  Methods  for  the  Solution 
of  the  Quadratic  Programming  Problem 

It  is  not  the  purpose  of  this  section  to  give  an  overview  of  interior  point  methods. 
We  primarily  address  the  structure  of  the  linear  systems  arising  in  these  methods  to 
provide  the  necessary  background  for  the  construction  of  preconditioners.  Because  of 
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space  limitations,  we  focus  on  primal-dual  interior-point  methods.  However,  matrices 
with  similar  structure  also  arise  in  barrier  methods,  see  e.g.  [19]  and  [8],  and  certain 
affine-scaling  methods,  see  e.g.  [18]. 

We  continue  to  use  the  notation  of  Section  3  and  we  will  employ  the  notation 
common  in  interior  point  methods:  For  a  given  vector  x,  the  diagonal  matrix  with 
diagonal  entries  equal  to  the  entries  of  x  is  denoted  by  X.  Moreover,  e  denotes  the 
vector  of  ones,  e  =  (1, . . .  ,  1)T. 

The  construction  of  primal-dual  interior-point  methods  is  based  on  the  so-called 
perturbed  KKT  conditions  corresponding  to  (3.4),  which  are  given  by 

Mx  +  C^p  -  q  =  -g, 

(4.1)  Cx  =  b, 

XQe  =  Be, 

and  x,q  >  0,  where  6  >  0.  To  move  from  a  current  iterate  (x,p,q)  with  x,q  >  0  to 
the  next  iterate  (x  +,p+,q+),  primal-dual  Newton  interior-point  methods  compute 
the  Newton  step  (Ax,Ap,Aq)  for  the  perturbed  KKT  conditions  (4.1)  and  set 

{x+,p+,q+)  =  ( x  +  axAx,p  +  apAp,q  +  agAq ), 

where  the  step  sizes  ax,ap,aq  G  (0,1]  are  chosen  so  that  x+,q+  >  0.  Then  the 
perturbation  parameter  9  is  updated  based  on  x+q+  and  the  previous  step  is  repeated. 
We  refer  to  the  literature,  e.g.  [20]  for  details. 

The  Newton  system  for  the  perturbed  KKT  conditions  (4.1)  is  given  by 

/  M  CT  -I  \  f  Ax  \  (  Mx  +  CTp  -q  +  g  \ 

(4.2)  \  C  Ap  =  —  Cx  -  b 

V  Q  X  )\Aq  )  \  XQe  —  Be  ) 

The  nonsymmetric  system  (4.2)  can  be  reduced  to  a  symmetric  system.  If  we  use  the 
last  equation  in  (4.2)  to  eliminate  A q, 

(4.3)  A q  =  —X_1QAx  -Qe  +  0X_1e, 
then  we  arrive  at  the  system 


M  +  X~lQ  C T 


Mx  +  CTp  +  g  —  9X  1e 
Cx-b 


If  Myu  =  0 ,Muy  =  0,  the  system  (4.4)  is  of  the  form  (1.7).  As  variables  yj  or  Ui 
approach  the  bound,  i.e.  approach  zero,  large  quantities  are  added  to  the  diagonals 
(j,j)  or  respectively. 

In  actual  computations  more  care  must  be  taken  during  the  reduction  of  the  system 
(4.2)  to  avoid  cancellation  in  the  reduction  process  due  to  very  large  elements  in  X -1, 
see  e.g.  [9].  A  stable  reduction  of  the  system  (4.2)  is  discussed  in  [9].  The  unknowns 
and  the  right  hand  side  in  that  reduced  system  differ  from  those  in  (4.4).  However, 
the  system  matrix  in  the  stable  reduction  is  equal  to  the  system  matrix  in  (4.4).  For 
our  purposes  it  is  therefore  not  necessary  to  present  the  lengthier  stable  reduction 
and  we  refer  to  [9]  for  details. 
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The  influence  of  inexact  solutions  of  the  linear  systems  (4.4)  onto  the  convergence 
behavior  of  the  primal-dual  interior-point  method  and  the  control  of  the  inexactness 
is  studied  in  [4],  [12]. 

Before  we  continue,  we  briefly  discuss  the  three  cases  explored  in  Section  3. 

No  bound  constraints.  In  this  case  the  diagonal  contributions  Dy  and  Du  com¬ 
ing  from  the  interior-point  method  will  be  zero  or  close  to  zero.  Since  in  our  case 
the  matrix  M  is  positive  definite,  the  system  (3.4)  has  a  unique  solution.  The  ill- 
conditioning  in  the  matrix  K  in  this  case  is  purely  due  to  the  discretization  of  the 
infinite  dimensional  control  problem. 

Bound  constraints  for  u.  It  has  been  observed,  e.g.  [10],  that  in  the  nondegenerate 
case  the  KKT  systems  in  barrier  methods  for  linear  programming  can  be  precondi¬ 
tioned  effectively.  This  will  also  be  true  in  our  case.  If  only  bounds  on  u  are  active, 
efficient  preconditioners  can  be  constructed  for  the  problems  investigated  in  this  pa¬ 
per.  However,  in  our  applications,  ill-conditioning  also  arises  from  the  matrices  A. 
Although  proven  to  be  nonsingular,  the  matrices  A  arising  in  our  applications  have 
a  wide  spectrum  which  causes  a  large  spread  in  the  spectrum  of  the  KKT  matrix  K. 
This  will  be  investigated  in  more  detail  in  Section  6. 

Bound  constraints  for  u  and  y.  For  the  construction  of  preconditioners  in  barrier 
methods  for  linear  programming  the  degenerate  case  is  the  difficult  one.  For  example, 
the  preconditioners  discussed  in  [10]  are  far  less  effective  in  reducing  the  condition 
number  of  the  KKT  matrix  in  the  degenerate  case  than  they  are  in  the  nondegenerate 
case,  cf.  Tables  1  and  2  in  [10].  This  will  also  be  the  case  in  our  situation.  If  bounds 
are  only  imposed  on  the  controls  u,  efficient  and  rather  general  preconditioners  can 
be  derived.  However,  if  state  constraints,  i.e.  bounds  on  y,  are  present  and  active, 
then  the  QP  (1.4)— (1.6)  is  very  often  degenerate  and  the  design  of  preconditioners  is 
much  more  difficult. 


5.  Solution  of  the  Linear  System 

5.1.  MINRES  and  SYMMLQ.  Two  Krylov  subspace  methods  for  the  solution  of 
indefinite  linear  systems,  MINRES  and  SYMMLQ,  have  been  introduced  in  [14]. 
These  methods  have  been  successfully  used  for  problems  like  the  one  studied  in  this 
paper  and  are  used  for  the  solution  of  our  systems. 

We  set  x  =  {yh,Uh,Ph)T-  Suppose  the  system  to  be  solved  is  Kx  =  b.  Given  an 
initial  iterate  x0  we  set  ro  =  b  —  Kx0.  The  Krylov  subspace  JCj(K,  ro)  is  defined  by 

(5.1)  £j(Af,  r0)  =  span{r0,  Kr0, ...  ,  K'-,_1r0}. 

In  iteration  j,  j  —  0, 1, . . . ,  the  minimum  residual  method  MINRES  computes 

Xj  e  ICj(K,  r) 


such  that  Xj  solves 


“in  JRJ--MI- 

rc€/Cj(A,ro) 


In  iteration  j,  j  =  0, 1, . . . ,  SYMMLQ  computes  the  iterate 


Xj  e  JCj(K,r0) 
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such  that  Xj  solves 

(r0  —  Kxj)Tv  =  0  Vu  G  fCj(K ,  r0). 

Since  if  is  indefinite,  such  an  Xj  may  not  exist.  If  it  does  not  exist,  SYMMLQ 
generates  an  iterate  using  information  obtained  from  the  Lanczos  tridiagonalization. 
See  [14], 

The  representation  of  Krylov  subspaces  (5.1)  show  that  Xj  G  ro)  if  and  only 

if 

Xj  =  Pj-\{K )  r0, 

where  is  a  polynomial  of  degree  less  or  equal  to  j  —  1.  This  yields  an  upper 
bound  for  the  residuals  in  MINRES: 

(5.2)  ||r0  -  Kxj\\  =  ||p](K)r0||  <  min^ma^  |px(A)|  ||r0||. 

Here  A (K)  denotes  the  spectrum  of  K  and  n]  denotes  the  set  of  all  polynomials  p 
of  degree  less  or  equal  to  j  which  satisfy  p(0)  =  1.  From  (5.2)  one  can  derive  error 
estimates,  see  e.g.  [16].  For  example,  using  Chebyshev  polynomials,  one  can  show 
the  following  convergence  estimate  for  MINRES: 

/K_  i x  U/2J 

||r0  —  Kxj\\  <  2  INI, 

where  k  =  X/X  is  the  condition  number  of  K  with  A^minAeAW  |A|,  A  =  max*^#,  |A|, 
and  [j /2j  is  the  largest  integer  less  or  equal  to  j/2. 

If  the  matrix  K  has  an  unfavorable  eigenvalue  distribution,  one  constructs  a  non¬ 
singular  matrix  P  such  that  K  =  P"1A'P“:r  has  a  smaller  condition  number  and 
better  clustered  eigenvalues.  Instead  of  Kx  =  6  one  solves  the  preconditioned  system 
Kx  =  b,  where  K  =  P-1KP~T,  x  =  PTx,  and  b  =  P'  H).  Of  course,  the  precondi¬ 
tioner  P  has  to  be  constructed  so  that  matrix-vector  multiplications  with  P~l  and 
P~T  can  be  done  efficiently  and  so  that  the  eigenvalue  distribution  of  P~lKP~q  is 
improved. 

For  more  details  on  MINRES  and  SYMMLQ  we  refer  to  [14],  [2],  and  [3].  Those 
references  also  contain  some  details  of  the  implementation.  Complete  listings  of  the 
preconditioned  MINRES  and  SYMMLQ  algorithms  are  given  in  [3].  We  have  imple¬ 
mented  MINRES  and  SYMMLQ  in  Matlab.1  Recently  a  version  of  the  QMR  algo¬ 
rithm  has  been  developed  in  [9]  to  solve  symmetric  indefinite  linear  systems.  These 
allow  the  application  of  indefinite  preconditioners.  If  the  preconditioner  is  positive 
definite,  as  in  our  case,  then  this  QMR  based  method  is  equivalent  to  MINRES. 

5.2.  Eigenvalue  Estimates.  If  A  is  invertible  and  if  Hy  and  Hu  are  positive  definite, 
then  the  matrix  K  defined  by  (1.7)  has  ny  +  nu  positive  eigenvalues  and  ny  negative 
eigenvalues.  More  information  on  the  eigenvalue  distribution  of  K  is  provided  by  the 
following  result,  which  is  proven  in  [15]: 


1A  Fortran  implementation  of  SYMMLQ  written  by  M.  Saunders  is  available  from  Netlib.  See 
linalg/symmlq  at  http://www.netlib.org/linalg/index.html. 
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Lemma  5.1  (Rusten/Winther).  Suppose  that  Hy  and  Hu  are  positive  definite  and 
that  ( A  |  B)  has  rank  ny.  Let  pi  >  >  ...  >  Hny+nu  >  0  be  the  combined 

eigenvalues  of  Hy  and  Hu  and  let  ox  >  <r2  >  . . .  >  a„y  >  0  be  the  singular  values  of 
( A  |  B)T .  The  eigenvalues  X\>  ...  >  Any+nu  >  0  >  \ny+nu+i  >  ■■■>  A2  ny+nu  of  K 
obey 


(5.3) 

(5.4) 


A2ny+nu  ^ 

Xnv+nu+l  if 


(5.5) 

(5.6) 


—  Triy+nu  > 


6.  The  Preconditioners 

We  now  turn  to  the  preconditioners  for  the  matrix  K  in  (1.7).  We  assume  that 
Hy  G  M""*'1",  Hu  G  Rn“><n“  are  symmetric  positive  definite  and  that  A  G  M"1'*"5'  is 
nonsingular. 

In  the  following  Py  and  Pv  are  preconditioners  of  Hy  and  Hu,  respectively,  i.e.  Pv 
and  Pu  are  nonsingular  matrices  such  that 

(6.1)  PylHyPfT  « I,  and  P~lHuP~r « I. 

By  A~l  we  denote  an  approximate  inverse  of  A, 

(6.2) 

In  our  numerical  tests  we  use  Pu  =  [diag^u)]1/2,  Py  =  [diag(JF/j/)]1/2,  and  A  =  A. 
Since  the  diagonals  of  the  mass  matrices  Mu  and  My  are  very  good  preconditioners 
for  these  matrices,  these  choices  for  the  preconditioners  Pu,  Py  are  efficient  and  satisfy 
(61). 

In  our  computations  we  use  K  derived  from  the  model  problem  and  the  finite 
element  discretization  outlined  in  Section  2.2.  In  all  computations  we  use  dx  =  dy. 
MINRES  and  SYMMLQ  were  used  with  starting  value  x0  =  (yh,uh,ph)  =  0  and 
the  iterations  were  stopped  when  ||P_16  —  P~1KP~TXj ||  <  10-5.  We  do  not  test 
our  preconditioners  within  an  interior-point  method,  but  simulate  the  matrices  K  in 
(1.7)  that  would  arise  in  an  interior  point  method  by  adding  diagonal  matrices  Dy 
and  Du.  All  computations  are  done  in  Matlab. 

In  the  analysis  of  the  preconditioners  it  will  be  helpful  to  distinguish  four  cases. 

Case  1  (7  =  1,  Dy  =  0,  Du  =  0):  In  this  case  we  can  reduce  the  condition  number 
of  the  systems  under  consideration  considerably.  By  preconditioning  we  reduce  the 
iterations  required  by  MINRES  and  SYMMLQ  to  a  number  which  appears  to  be 
independent  of  the  grid  size. 

Case  2  (7  1,  Dy  =  0,  Du  =  0):  In  this  case,  the  spectrum  of  Hu  moves  towards 

the  origin,  and  while  the  conditioning  of  Hu  itself  is  not  changed,  the  condition  num¬ 
ber  of  K  increases  significantly.  In  this  situation,  ill-conditioning  of  K  is  induced 
by  ill-posedness  of  the  original  problem.  As  7  decreases,  the  system  with  K  be¬ 
comes  hard  to  solve,  and  for  sufficiently  small  values  of  7  MINRES  and  SYMMLQ 
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need  an  unacceptably  large  number  of  iterations.  The  performance  of  MINRES  and 
SYMMLQ  improves  on  the  preconditioned  systems. 

Case  3  (7  =  1,  Dy  =  0,  Du  ;»  I):  If  bound  constraints  for  u  are  active,  corre¬ 
sponding  diagonal  entries  in  Du  increase.  We  write  Du  3>  I  and  mean  this  to  be 
understood  component  wise.  Large  entries  in  Du  can  be  shown  to  affect  the  condi¬ 
tioning  of  the  preconditioned  system  only  to  a  moderate  amount.  In  fact,  they  can 
even  help  to  neutralize  a  small  parameter  7  or  large  entries  in  Dy.  In  this  case  our 
preconditioners  are  very  effective. 

Case  4  (7  =  1,  Dy  »  /,  Du  =  0):  This  case  corresponds  to  the  situation  where 
bound  constraints  on  y  are  active.  As  mentioned  in  Sections  3  and  4  the  solution 
may  be  degenerate  and  this  case  may  correspond  to  the  degenerate  case  in  linear 
programming.  Often,  a  large  diagonal  in  Hy  unfavorably  affects  the  performance  of 
MINRES  and  SYMMLQ  on  the  preconditioned  systems.  While  the  preconditioners 
introduced  in  the  following  lead  to  some  improvement,  their  effectiveness  in  this  case 
is  much  smaller  than  in  the  Cases  1  and  3.  We  point  out  that  in  our  applications  the 
number  ny  of  states  is  much  larger  than  the  number  nu  of  controls.  Hence  if  more 
than  nu  states  are  active  at  the  solution,  then  the  matrix  C  in  (3.6)  can  not  have  full 
row  rank.  In  our  numerical  tests  for  Case  4  we  set  Dy  —  104/.  This  simulates  the 
worst  case  in  the  sense  that  this  corresponds  to  the  case  where  all  states  approach 
the  bounds.  Our  numerical  tests  always  correspond  to  the  degenerate  case,  which  is 
the  hard  case. 


6.1.  The  First  Preconditioner.  The  first  preconditioner  is  given  by 

0  0  \ 


p-i 

1  y 


P{1  =  0  P~ 


— 1 


0 


0 


The  preconditioned  KKT  matrix  is 


(6.3)  P{lKP{T  = 
and  we  expect  that 


P^HyPyT 


PyA~1AP~T  Py  A~]  B  P~T 


0  PjA-1 


0  PylA-TATPy 

PulHuP~T  P~1BTA~T  Py 
0 


In 


(6.4) 


PffKPf-  =  0 

V  Ira„ 


0  Iny_  \ 

in,  Pu1BTA-TPy  , 
PjAXB  PUT  0  ) 


where  /  is  an  approximate  identity  matrix.  The  preconditioned  system  still  has  the 
structure  allowing  us  to  estimate  its  spectrum  using  Lemma  5.1.  The  derivation 
of  the  general  form  of  our  first  preconditioner  is  motivated  by  the  assumption  that 
for  preconditioners  Py,  Pu  of  Hy,  Hu  and  for  an  approximate  inverse  A-1  of  A  the 
singular  values  of 

(6.5)  B  =  P^A^BP~t 

are  of  moderate  size.  If  Py  =  M4/2,  Pu  =  M^2,  and  A  =  A,  this  is  guaranteed  in  the 
situation  of  Section  2.1.  See  Lemma  2.1. 
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Lemma  6.1.  Let  B  £  Rni,><n“.  The  singular  values  cq  of  (Im\B)  are  given  by 

<Ji  =  yJl  +  o^B),  i=l,...,nv, 

where  Oi(B)  are  the  singular  values  of  B.  If  ny>  nu,  B  has  nu  singular  values,  and 
we  set  <Ji{B)  =  0  for  i  =  nu  +  1, . . .  ,ny. 

Proof.  The  proof  follows  immediately  from  the  fact  that  the  squares  of  the  singular 
values  of  a  matrix  B  are  the  eigenvalues  of  BBT .  □ 


In  the  situation  of  Section  2.1  the  estimate  in  Lemma  2.1  shows  that 

(6.6)  ofB)  <  \\M^2A~lBM~l/2\\  <  c,  *  =  1, . . .  ,  ny, 

for  a  constant  c  independent  of  h.  Thus  in  Case  1  ( Hy  =  My  and  Hu  =  Mu)  we  expect 
that,  for  preconditioners  Pu,  Py  and  A  neutralizing  the  dependency  of  Hv,  Hu  and  A 
on  the  mesh  constant  h,  we  can  similarly  bound  the  singular  values  of  Py A~1BPtt 
such  that 

(6-7)  <n(B)  <  \\Py  A~1BPfT\\  <  cP, 

where  cp  is  a  constant  independent  of  h. 

Assuming  that  (6.7)  is  valid  we  discuss  the  expected  performance  of  the  first  precon¬ 
ditioner  in  the  four  cases  defined  earlier.  By  <7®  =  af\Py  A~lBP~T),  l  =  1,2, 3, 4, 
we  denote  the  singular  values  of  Py  A_1PP“T  in  Case  l  =  1,2, 3, 4. 

Case  1  (7  =  1,  Dy  =  0,  Du  =  0):  If  7  =  1,  (6.7)  shows  that  there  exists  a  constant 
upper  bound  for  the  singular  values  o ^{Hy^A^BHf1^).  The  preconditioner  Pi  can 
be  expected  to  perform  well  if  the  preconditioning  matrices  Py,  Pu  and  A  neutralize 
the  influence  of  the  mesh  size  h  on  the  submatrices  and  thus  on  the  system,  and 
if  the  singular  values  of  Py  A^BPff7  are  bounded  by  a  small  constant  Cp.  If  the 

eigenvalues  of  P~lHyPyl  and  P~THuP~l  are  close  to  one  and  if  cr^jn  <C  1,  where 
crj1)  denote  the  singular  values  of  (PjA_1PP“T),  we  can  deduce 

^ny+nu  ~  1)  ^nj,+nu+l  ~  (l  —  j 

so  that  the  eigenvalues  of  the  preconditioned  system  are  bounded  away  from  zero. 
If  in  addition  cr$ax,  i.e.  the  constant  cp  in  (6.7)  is  of  moderate  size,  Lemma  5.1 
guarantees  that  the  condition  number  of  the  preconditioned  system  Pf1KPf  J  is 
small.  MINRES  and  SYMMLQ  will  perform  very  well  on  the  preconditioned  system. 
This  is  confirmed  by  our  numerical  tests.  See  Table  1. 

The  preconditioner  will  perform  poorly  if  the  singular  values  of  PjA“1PPu‘T  are 
not  small.  This  happens  in  two  of  the  remaining  three  cases. 

Case  2  (7  4C  1,  Dy  =  0,  Du  =  0):  If  a  small  parameter  7  determines  the  size 
of  the  eigenvalues  of  the  matrix  Mu,  we  must  expect  that  bounds  on  the  norm 
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||P1/2A-1PP“1/2||  grow  with  the  reciprocal  of  ^7.  Denoting  by  ct8(2)  the  singular 
values  of  we  have  the  relationship 


m(2)  = 


T0) 


V7 


For  decreasing  values  of  7  the  spectrum  of  Py A  1BPU  T  expands  and  the  conditioning 

of  the  preconditioned  system  deteriorates. 

Case  3  (7  =  1,  Dy  =  0,  Du  »  I):  In  this  case  Hu  =  7 Mu  +  Du ,  where  Du  »  /, 

i.e.  some  diagonal  entries  may  become  very  large.  Analogously  we  write  Pu  =  7 Po  + 

PD,  where  Pd  stands  for  the  (large)  diagonal  entries  and  Po  for  the  off-diagonal 

entries  that  are  generally  of  moderate  size.  By  we  denote  the  singular  values  of 

PT A-1  B P.~t  .  We  obtain  the  estimate 
y  u 

uf}  =  af\Py  A~l BP~t)  =  of\Py  . A~lB{piP0  +  PD)~T) 

=  af\P^A~lBPET{lPDlPo  +  /)“T) 

<  \\P^A~lB\\  \\PSt\\  117(^0  +  /)-!  <  \\PBT\\  1  _  ||7^p-fj|  • 


If  Du  dominates  the  matrix  Hu,  ||7F>oDD:r||  will  be  of  negligible  size.  If  additionally 
7  <  1,  this  contributes  to  reducing  the  factor  1/(1  —  ||7^>o-Pp1||)  to  a  constant 
close  to  one.  The  norm  ||PjA_15||  can  be  expected  to  be  of  moderate  size,  while 
HP"1 1|  will  be  very  small.  The  singular  values  it(3)  converge  to  zero  as  the  entries  in 
the  diagonal  Du,  and  with  it  in  PD,  grow.  In  the  case  of  large  diagonal  entries  in 
Hu  we  can  expect  a  good  performance  of  the  solvers  on  the  preconditioned  system, 
due  to  a  small  condition  number  of  P{lKP{T  which  is  in  turn  induced  by  small 
singular  values  of  PjA_1PP-T.  The  performance  of  MINRES  and  SYMMLQ  on  the 
preconditioned  system  is  documented  in  Table  2. 

Case  4  (7  =  1,  Dy  »  I,  Du  =  0):  If  we  denote  by  Py  the  preconditioner  for  Hy 
and  by  P0,  PD  its  off-diagonal  part  and  its  diagonal  part,  respectively,  then  we  see 
that  the  matrix  Py  A~lBPpT  will  have  very  large  singular  values.  This  is  indicated 
by  the  estimates  (M  =  A-1  B P^T Pff1  BT A~T) 

A max((Po  +  PD)TM{Po+PD))  >  ^ PD)  + \min(P£ M PD  + P^ M PD  + P%M PQ) 


and 

\min{{Po+PD)TM{Po  +  PD))<XTnin{P'EMPD  +  PlMPo+Pj)MPD)+\max{PoMPo)- 

For  the  estimates  see  [11,  p.  411].  While  the  preconditioner  yields  a  considerable 
improvement  over  the  unpreconditioned  system,  the  improvement  is  less  than  in 
Cases  1  and  3.  See  Table  3.  However,  the  improvement  is  expected  to  decrease  as 
the  diagonals  in  Dy  become  larger. 

6.2.  The  Second  Preconditioner.  We  have  seen  that  the  effectiveness  of  precondi¬ 
tioner  Pi  depends  on  the  size  of  the  singular  values  of  the  matrix  B  defined  in  (6.5). 
The  preconditioner  P2  is  designed  to  isolate  the  effect  of  B.  In  order  to  make  the 
action  of  the  second  preconditioner  transparent,  we  consider  the  ideal  version  of  P2, 
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denoted  by  P2*,  i.e.  we  choose  Pu  =  H]J2,  Py  =  H y2,  and  A  =  A.  For  the  general 
form  of  the  preconditioner,  which  is  used  in  the  computations,  we  refer  to  [3]. 

The  ideal  preconditioner  P2*  is  given  by  its  inverse  as 

/  tf“1/2  0  0  \ 

(p;rl  =  o  h-^  o 

V  -P“1/2  -Hyl2A~lBH~l  Hy2A~l 
The  ideal  preconditioned  system  is 

(In,  0  0 

p:-1kp;-t  =  o  o 

V  0  0  ~(Iny  +  BBT) 

where  B  is  defined  by  (6.5)  with  Pu  =  Hy2,  Py  =  Hy2,  and  A  =  A. 

The  application  of  the  preconditioner  P2  is  roughly  as  expensive  as  the  application 
of  the  preconditioner  P:.  The  performance  of  P2  is  slightly  inferior  to  the  performance 
of  Pi.  See  Tables  1-3.  The  eigenvalue  distribution  of  the  preconditioned  system, 
i.e.  the  eigenvalue  distribution  of  ( IHy  +  BBT),  can  be  analyzed  analogously  to  the 
previous  case. 


Table  1 

Iterations  of  MINRES  and  SYMMLQ  on  K  with  7  =  1,  Dy  —  Q,  Du  =  0. 


grid  size  dx 

5 

10 

15 

20 

25 

30 

dimension 

92 

282 

572 

962 

1452 

2042 

Without  Preconditioning 


MINRES 

47 

185 

431 

784 

1070 

1483 

SYMMLQ 

47 

179 

407 

647 

902 

1209 

Preconditioner  Pi 


MINRES 

23 

25 

24 

21 

: 

21 

19 

SYMMLQ 

23 

24 

22 

21 

19 

19 

Preconditioner  P2 


MINRES 

24 

35 

37 

37 

35 

35 

SYMMLQ 

24 

35 

36 

35 

35 

33 

Preconditioner  P3 


MINRES 

7 

6 

5 

5 

5 

4 

SYMMLQ 

7 

6 

5 

5 

5 

4 

6.3.  The  Third  Preconditioner.  A  third  preconditioner  is  derived  from  reductions 
performed  to  solve  QP  subproblems  in  sequential  quadratic  programming  methods, 
see  e.g.  [7].  As  before  we  use  the  ideal  form  for  the  presentation  of  the  preconditioner. 
The  general  form  of  the  preconditioner,  see  [3],  is  used  in  the  computations.  The  ideal 
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Table  2 

Iterations  of  MINRES  and  SYMMLQ  for  K  with  7=1  and  Du  =  104  ■  I,  Dy  =  0. 


grid  size  dx 

5 

10 

15 

20 

25 

30 

dimension 

92 

282 

572 

962 

1452 

2042 

Without  Preconditioning 


MINRES 

54 

173 

349 

589 

857 

1183 

SYMMLQ 

54 

173 

349 

579 

848 

1165 

Preconditioner  Pi 


MINRES 

16 

18 

18 

18 

18 

16 

SYMMLQ 

16 

18 

18 

18 

17 

16 

Preconditioner  P2 


MINRES 

21 

33 

35 

37 

35 

35 

SYMMLQ 

21 

33 

35 

35 

33 

33 

Preconditioner  P3 

MINRES 

5 

4 

4 

4 

4 

4 

SYMMLQ 

5 

4 

4 

4 

4 

4 

Table  3 

Iterations  of  MINRES  and  SYMMLQ  for  K  with  7  =  1  and  Dy  =  104  ■  I,  Du  =  0. 


grid  size  dx 

5 

10 

15 

20 

25 

30 

dimension 

92 

282 

572 

962 

1452 

2042 

Without  Preconditioning 

MINRES 

73 

282 

572 

962 

1452 

2042 

SYMMLQ 

73 

282 

572 

962 

1452 

2042 

Preconditioner  Pi 


MINRES 

50 

98 

194 

289 

449 

530 

SYMMLQ 

50 

98 

187 

283 

410 

524 

Preconditioner  P2 


MINRES 

61 

146 

235 

323 

453 

583 

SYMMLQ 

61 

143 

233 

323 

447 

547 

Preconditioner  P3 


MINRES 

44 

67 

120 

203 

275 

366 

SYMMLQ 

44 

56 

120 

167 

286 

355 

preconditioner  P3*,  given  by  its  inverse  as 


(P3)-1 


Iny  0  -1/2  HvA~l 

0  0  A~l 

—{A~lB)T  Inu  (A-'B)THyA~i 
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transforms  K  into  the  preconditioned  system 

/  0  In 

{p;)-1k{p*)-t  =  I  /„„  o 


\o 


where 


W  = 


-A~1B 


H  = 


0  WTHW 

Hy  0 
0  H„ 


The  matrix  IT  is  a  representation  for  the  nullspace  of  C  =  (A\B).  The  matrix 
WTHW  is  given  by 

WtHW  =  BTA~THyA~1B  +  HU  =  H1J2(btB  +  W2, 


where  B  is  defined  by  (6.5)  with  Pu  =  H^2,  Py  =  H y/2,  and  A  =  A.  Note  that  the 
partitioning  of  the  blocks  in  the  preconditioned  system  has  changed. 

The  preconditioner  P3  is  the  most  effective  in  reducing  the  number  of  iterations. 
See  Tables  1-3.  However,  the  application  of  the  general  preconditioner  P3  is  roughly 
twice  as  expensive  as  the  application  of  the  preconditioners  Pi  and  P2.  See  [3].  The 
eigenvalue  distribution  of  WTHW  can  be  analyzed  analogously  to  the  preconditioned 
system  with  Pi . 


7.  Conclusions 

In  this  paper  we  have  derived  preconditioners  for  matrices  K  arising  in  the  numeri¬ 
cal  solution  of  certain  distributed  linear  quadratic  control  problems  by  interior-point 
methods.  The  preconditioners  are  in  block  form,  with  blocks  composed  of  precon¬ 
ditioners  for  the  individual  blocks  of  the  matrix  K.  This  allows  the  incorporation 
of  known  preconditioners  for  the  governing  equations  of  the  original  problem  and  it 
allows  to  reuse  computationally  expensive  information  within  all  interior-point  iter¬ 
ations.  The  effectiveness  of  the  preconditioners  was  analyzed  using  the  properties 
of  the  control  problem  and  its  discretization,  the  block  structure  of  the  matrix  K, 
and  information  from  the  optimality  conditions.  Numerical  results  supporting  the 
theoretical  analysis  were  given. 
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Abstract.  We  investigate  augmented  Lagrangian  algorithms  to  solve  state  and  control 
constrained  optimal  control  problems.  We  augment  both  the  state-equation  and  the  non¬ 
smooth  state  and  control  constraints.  We  present  the  method  with  the  example  of  linear 
optimal  control  problem  with  a  boundary  control  function  but  the  proposed  algorithms  are 
general  and  can  be  adapted  to  a  much  wider  class  of  problems. 
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Key  words  and  phrases.  State  and  control  constrained  optimal  control  problems,  augmented 
Lagrangian,  elliptic  equations. 


1.  Setting  of  the  Problem 

Let  0  be  an  open,  bounded  subset  of  IR”  ,  n  <  3  ,  with  a  smooth  boundary  T.  We 
consider  the  following  optimal  control  problem: 

CP)  min  J(y,u)  =  -  [  (y  -  zd f  dx+^  [  ( u  -  udf  da 

Z  Jn  Z  Jr 


(i.i) 


Ay  =  /  in  fi  ,  y  —  u  on  T  , 


(1.2)  Ait/  e  K  ,  u  e  U  , 

where 

•  /,  2^6  L2(f2),  u,  ud  G  L2(r)  and  either  a  >  0  or  U  is  bounded  in  L2(T), 

•  L  is  a  finite  dimensional  (Hilbert)  space  and  Ai  e  £(W,  L ),  ( W  is  defined  just 
below). 

•  K  and  U  are  nonempty,  closed,  convex  subsets  of  L  and  L2(r)  respectively. 

•  A  is  an  elliptic  operator  defined  by: 

'  n 

Ay  =  -  X)  dXi{aij(x)dXjy)  +  a0(x)y  with 
i,j= 1 

•  aij,a o  G  C2(fl)  for  i,j  =  1,...  ,n,  inf  (ao(a:)  |  x  G  fl}  >  0 
E  atj(x)^  >  , Vx  e  fi, G  IR", 8  >  0 . 

ij= 1  2=1 


(1.3) 
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System  (1.1)  is  well-posed:  for  every  (u,f)  G  L2(T)  x  L2(Ll)  there  exists  a  unique 
solution  y  =  T(u,  f)  in  W,  where 

W  =  {  y  G  L2(Q)  |  Ay  G  L2(Ll)  ,  y]r  G  -L2(r)  }. 

Moreover  T  is  continuous  from  L2(F)  x  L2(Q)  to  W,  when  W  is  endowed  with  the 
graph  norm: 

\y\2w  =  \y\l  +  \Ay\l  +  \yw\l. 

From  now  on,  when  H  is  an  Hilbert-space,  we  denote  by  (  ,  )H  (resp.  (  ,  )n  and 
(  ,  )r)  the  H  (resp.  L2( Q)  and  L2(r))  inner  products  and  by  |  \H,  |  |n,  |  |r,  the 
H,  L2( tt)  and  L2(F)-norms,  respectively.  Moreover,  we  define  A  :  W  x  L2(T)  — > 
L  x  L2(T)  by  A (y,v)  =  (A \(y),v)  and  we  assume  that  the  feasible  domain 

V  =  {  (y,u)  G  W  x  L2(r)  |  Ay  =  /  in  Q,  y  =  u  on  F,  A (y,  u)  G  K  x  U  }  , 

is  nonempty.  It  is  easy  to  see  that  problem  {V  )  has  a  unique  solution  ( y ,  u)  since  the 
functional  J  is  strictly  convex  and  coercive  and  V  is  convex,  closed  and  nonempty. 
Our  main  purpose  is  to  present  new  augmented  Lagrangian  algorithms  to  solve  nu¬ 
merically  optimal  control  problems  of  the  above  type.  Usually  such  algorithms  use 
the  augmentation  of  the  “smooth”  part  of  the  constraints,  that  is  the  state-equation. 
This  has  been  done  in  Fortin-Glowinski  [3]  and  adapted  to  the  present  example  in 
Bergounioux  [1].  Here  we  use  a  different  point  of  view,  since  we  use  a  Lagrangian  func¬ 
tion  where  both  the  state-equation  and  the  nonsmooth  constraints  “A xy  G  K,  u  G  U” 
are  augmented.  These  last  constraints  are  augmented  using  a  method  developed  in 
Ito  and  Kunisch  [4]. 


2.  Optimality  Conditions 

In  this  section  we  recall  a  result  which  is  crucial  to  interpret  the  forthcoming 
algorithms  and  to  give  convergence  results. 

Theorem  2.1.  Let  (y,  u)  be  the  optimal  solution  of  (V)  and  assume  the  following 
qualification  condition 

There  exists  a  bounded  (in  L2(Ll)  x  L2{T))  subset  Ad  of  W  x  L2(T)  such 
.  .  that  A{M)  C  K  x  Uand  0  €  Int2(V(M))  , 

'  '  where  Int2  denotes  the  interior  with  respect  to  the  L2(tt)  x  L2(T) -topology 

and  V(y,  u)  =  (Ay  -  /,  y\T  -  u). 

Then  there  exists  ( q,f )  G  L2(Ll)  x  L2(T)  and  (pi,p,2)  G  A1(IU)  x  L2(T)  such  that: 

(2.1)  Ay  =  f  in  LI  ,  y  =  u  onT  , 

(2.2)  (y~zd,  y)n  +  (q,  Ay)n  +  (f,  y)T  +  (fti,  Aij/)x  =  0  for  all  y  eW  , 

(2.3)  a  (u-ud)  =f-/x2  €  L2(r)  , 


(2.4) 


(fix,  Kx(y  —  y))L  <  0  for  all  y  such  that  Ai y  G  K 
(p>2 ,  u  —  u)r  <  0  for  allu  G  f/  . 
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Proof.  This  is  a  particular  case  of  a  more  general  result  that  can  be  found  in  [2],  ■ 

As  a  specific  example,  L  can  be  chosen  as  the  set  of  linear  finite  elements  with 
respect  to  a  triangulation  of  and  Ai  :W—*L  can  be  the  L2-projection. 

3.  Lagrangian  Algorithms 

In  this  section  we  turn  to  the  numerical  realization  of  the  constrained  optimal 
control  problem  (V).  We  shall  combine  the  techniques  from  [1]  and  [4]  augmenting 
the  state  equation  as  well  as  the  constraints  characterizing  the  feasible  set  D,  to 
obtain  well  performing  algorithms. 

3.1.  Augmentation  of  the  State  Equation.  First  we  recall  an  augmented  Lagrangian 
algorithm  based  on  the  penalization  of  the  state  equation  (see  [3] ,  [1]  and  the  refer¬ 
ences  therein). 

Algorithm  A0 

•  Step  1.  Initialization:  Set  n  =  0,  and  choose  7  >  0,  qa  G  L2(Q),  r„  G  L2(T). 

•  Step  2.  Compute 

(: Vn ,  Un)  =  Arg  min  Ly(y,  v,  qn>  rn) 

A  {y,u)GKxU 

where 

ry  /y 

L7(y,  u,  q,  r)  =  J(y ,  u )  +  {q,  Ay)n  +  {r,y  -  u)T  +  - \Ay  -  f\2n  +  - \y  -  u|r 

is  the  augmented  Lagrangian  with  respect  to  the  state  equation  constraint. 

•  Step  3.  Set 

qn+ 1  =Qn  +  Pi  {Ay„  -  /)  where  p1  G  (0, 2q  ]  , 
r„+ 1  =rn  +  p2  {yn\r  -  un)  where  p2  G  (0, 2q  ]  . 

The  analysis  of  this  algorithm  is  rather  standard,  see  [1]  and  the  references  there.  For 
the  convenience  of  the  reader  we  provide  a  precise  convergence  result  (which  appears 
to  slightly  generalize  the  existing  ones)  and  give  a  concise  proof. 

Theorem  3.1.  Let  ( y ,  u )  be  the  solution  to  (V)  and  suppose  that  (H)  holds.  Then  the 
iterates  of  Algorithm  A0  satisfy 

(3.1)  | yn  -  y\2n  +  a\un  -  u\2r  +  ^k+i  -  q\l  +  “  flr 

+  (7  -  y)l Ayn  -  f\2n  +  (7  -  j)\Vn  -  un\2r  <  ~\qn  -  q\2n  +  -~\rn  -  f|^ 
for  all  n  =  0, 1, 2, . . .  .  This  implies 

OO  OO  OO 

(3-2)  J2  \Vn  -y\h  +  aY,  K  -  “lr  +  (7  -  y I  AVn  -  f  In 

n=0  n= 0  n=0 

n  00  1  1 

+  (7  -  ~)  E  I  Vn  ~  Un\l  <  190  -  q\l  +  X-ko  -  r\l  , 

2  „=0  lPl  ZP2 

and  in  particular  strong  convergence  of  ()/„,«„)  — >  (y,  u)  in  L2(fl)  x  L2(r),  and 
boundedness  of  {(qn,rn)}.  If  moreover  pi  <  27  and  p2  <  27  then  ( yn,un )  — >  (y,  u) 
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inW  x  L2(Y),  and  every  weak  limit  (q,  f)  of  (qn,  rn)  has  the  property  that  (y,  u,  q,  f) 
satisfies  (3.3). 


Proof.  Prom  Theorem  2.1  we  obtain 

(3.3)  (j'(y,  u) ,  (y,  u)  -  (y, «))  (q,  A(y  -  y)j  +(r,y  -  y  -  (u  -  «))  p>  ° 

for  all  A (y,u)  G  K  xU.  The  solutions  (yn,un)  of  Step  2  are  characterized  by 

(j'(yn,  Un),  (y,  u)  -  (yn,  u«))  nxr  +  ($*+!>  A(V  ~ 

(3.4)  +  (rn+1)  y-yn-(u-  un)j  ^  +  (7  -  Pi)  {^Vn  ~  f,A(y- 

+{l  -  p2)(yn-un,y -yn-  (u-un)^  >0 

for  all  A (y,u)  eKxU.  Adding  (3.3)  with  (y,  u)  =  {y„,un)  to  (3.4)  with  (y,u)  =  (y,u) 
one  obtains 

(3.5)  (j'{yn,  un)  -  J'(y,  ft),  (y„,  un)  -  (y,  u)j  ^  +  (qn+i  -q,Ayn-  Ay'j  ^ 

+  (r„+i  —  f,yn  —  un) r  +  (7  -  Pi)|Ay„  -  f\h  +  (7  -  P2)|y«  -  un |r  <  0. 
Let  us  note  the  following  equality 

(3.6)  (a  +  pb,  b)fi  =  ~^\a  +  P^?h  ~  + 

for  all  elements  a,b  of  a  real  Hilbert  space  H  and  all  p  G  IR. 

Due  to  (3.6)  we  find 

(qn+ 1  -  q,A(yn  -y))n  =  T^kn+i  ~o\q  -  “ffln  +  yl"4^  -  ^In 

and 

(rn+ 1  —  f,yn  —  =  -~H»»+i  -f|r  -  ~^f\Tn  ~  flr  +  ylP™  ~  un\r  ■ 

Inserting  these  equalities  into  (3.5)  we  obtain 


\Vn  -  y\n  +  ol\ un  -  u\\  +  —  \<ln+i  -  q\n  +  \rn+i  ~  r\r 
+y  | Ayn  -  /|n  +  y  | y„  -  un\l  +  (7  -  Pi)|Ay„  -  / In  +  (7  -  p2)\yn  -  un |r 


and  (3.1)  follows.  Using  a  telescoping  argument  (3.2)  is  implied  by  (3.1).  The  ascer¬ 
tained  convergence  properties  follow  from  (3.2),  (3.4)  and  uniqueness  of  ( y,u,q,f ). 
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3.2.  Augmentation  of  the  Non  Smooth  Constraints.  The  main  remaining  problem 
is  the  resolution  of  the  auxiliary  problem  of  Step  2  in  Algorithm  A0 •  This  auxiliary 
problem  can  be  written  as: 

{yn,un)  =  Argmin  L7{y,u) 

A (y,u)  €  D  . 

To  simplify  the  notation  we  omit  to  indicate  the  dependence  of  L7  on  q  and  r.  During 
Step  2  these  functions  are  fixed.  We  set  H  =  L  x  L2(r).  Let  <p  be  the  characteristic 
function  of  the  convex  set  D.  Then,  following  [4],  we  define  (for  any  c  >  0)  the 
function  <pc :  H  x  H  — >  IR  by: 

(3.7)  ipc{x,  A)  =  inf{  <p(x  -£)  +  (A,f)ff  +  }  , 

where  x  =  ( y,u ).  Here  (  ,  )H  (denoting  the  H- inner  product)  is  given  by  = 

(^i>£i)n  +  (-^2i^2)r  j  with  A  =  (Aj,  A2)  and  £  =  (£1,62)- 

We  recall  some  properties  of  the  function  tpc  (for  more  details  one  can  refer  to  [4,  2]): 
Proposition  3.1.  For  all  x  =  (y,u)  £  H  and  A  =  (Ai,  A2)  £  H 

A)  =  %  -  Pk(v  +  ^-)\l+ (Xi,y  -  PK{y  + -j)) 

{3.8)  2  \  \ 

+  C-\u  -  Pu(u  +  -f)\2r  +  (A 2,u  -  Pv{u  +  ^))r  , 

(3.9)  )  =  c(y  +  ^- -PK(y+^-),u+>^--Pv(u  +  ^)j  , 

where  PK  (resp.  Pv,  PD)istheL  (resp.  L2(T),  H )  projection  on  K  (resp.  onU ,  D). 

Proof.  See  [2].  ■ 

We  are  going  to  use  the  following  algorithm  and  a  splitting  variant  to  solve  the 
auxiliary  problem: 


Algorithm  A\ 

•  Step  1.  Initialization:  Choose  A°  £  H  and  c  >  0. 

•  Step  2.  Compute 

(: yj ,  Uj )  =  Arg  min  L7(y,  u )  +  ^c(A(j/,  u),  X3) 

A(y,u)  eW  x  i2(T)  , 

where  cpc  has  been  defined  in  the  previous  section. 

•  Step  3.  Set 

\i+1  =  <p'c{K{y?  ,u3),X3)  =  c(Aiy3  +  —  -Pk(AiJ/j+— ),uj-\ — --Pu{wl+-^)\ 

V  c  c  c  c  / 

(see  3.9). 

The  convergence  of  this  algorithm  under  the  assumption  that  L  is  finite  dimensional 
follows  from  result  in  [4]. 
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3.3.  Final  Algorithm.  We  now  write  the  version  where  Algorithm  A\  appears  as  an 
inner  loop  in  algorithm  A0: 

Algorithm  A 

•  Step  1.  Initialization:  Set  n  =  0,  and  choose  7  >  0,  c  >  0. 

Choose  (q0,r0)  £  L2(Q)  x  L2(T)  and  A0  =  (A0i,Ao2)  £  Lx  L2(T). 

•  Step  2.  Choose  kn  £  IN,  set  A°  =  An  ,  and  for  j  =  0, . . .  ,  kn: 

(yi,u{)  =  Arg  min  £7(t/,  u,  qn,  rn)  +  <pc(A(y,  u),  XJn) 

(y,u)  £  W  x  L\Y)  , 

K+1  =  (K+iXfl)  with 

* 

AS'  =  clA.Ki  +  Al-P^A^+Al)], 

-  4<  +  -  Pu(<  +  %]  • 


End  of  the  inner  loop: 


Aji+1  ;  Vn  Vn  i  tln  Un  . 


Step  3. 


qn+ 1  =  qn  +  Pi  (tt^TT  £  Ayn  ~  f)  where  ft  6  (°> ]  > 

K"n  ~ t  j— 0 

rn+i  =  rn  +  p2  (7 rwr  5Z (2/^.lr  -  <))  P2  e  (0, 27  ]  . 

“T  J-  j_o 

Theorem  3.2.  Let  (y,  u)  be  the  solution  to  (V)  and  suppose  that  (H)  holds.  Let 
(q,  r,  p)  £  L2(Yl)  x  L2(r)  x  L  x  L2(r)  be  an  associated  Lagrange  multiplier.  Then  the 
iterates  of  Algorithm  A  satisfy 

I yn  -  y\h  +  a\un  -  u\y  +  kn+i  -  q\u  +  ■  2p2  "lr«+1  ~  flr 

(3.10)  +(7  -  y)| Ayn  -  /In  +  (7  -  y)lun  -  2/nlr  +  ,^1'Wi  -  L\lxL2(r) 

<  ^  "I^W  —  ^ _lr”  _  +  2c^n  ~~  ^lix£2(r) 

/or  all  n  =  0, 1, 2, . . .  .  Ifknis  nonincreasing  this  implies 

OO  OO 

IZ  ll/n  -  tfln  +  “  E  K  -  «lr 


_  00  n 

+(7  -  ~)  £  1^™  -  /In  +  (7  -  y )  £  K  -  S**lr 

Z  n=0  Z  n=0 

<  °+  I  go  -  ?ln  +  2  ko  -  r|p  +  —|A0  -  M||Xi2(r) 


(3.11) 
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and  the  strong  convergence  of  (yn,un)  — >  (y,  u)  in  L2(£l)  x  L2(T),  and  boundedness 
of  {(qn,rn,  An)}-  If  moreover  pi  <  27  and  p2  <  2j  then  ( yn,un )  ->  ( y,u )  in  W  x 
L2( F)  and  every  weak  limit  ( q ,  r ,  A)  of{(qn,  r„,  An)}  has  the  property  that  (y,  u,  q,  r,  A) 
satisfies  (2.2),  (2.3). 


Proof.  From  (2.2),  (2.3)  it  follows  that 


(3.12 )  fj'(y,u),(y,u))  +  (q,  Ay)n  +  (r,y  -  u)r  +  (p,  A(y,u))  =0 

V  /nxr  V  /  LxL2(r) 

for  all  (y,u)  e  W  x  L2(F).  The  solutions  (y3n,u3n)  of  Step  2  satisfy 

(j'(yL<),(y,u))  +  (qn+-^—{Ay3n-f),Ay) 

\  /nxr  V  k„  + 1  /n 

(3.13)  +(7  ~Y^)(AV}n-f,Ay)a+  (r"  +  kf+l^  ~  U^,V  ~  U)r 

+(l-T^~r)(yi-ui,y-u)r+U'c{A(yiui),\i),A{y,u)]  =0 

kn  + 1  V  /  lxl2(  r) 


for  all  ( y,u )  €  IF  x  L2(r).  Let  us  denote  qnl  qn,  rnl  :=  rn  and  for  j  = 
0, . . .  ,  kn,  n  E  IN 


ql  =  Qn  + 


Pi 

kn  +  1 


E(^n  “  /).  r{  =  rn  + 


i= 0 


P2 

kn  T  1 


From  now,  for  convenience,  we  omit  the  indication  of  the  norm  since  there  is  no 
possible  confusion.  From  [4]  it  follows  that  p  =  ip'c(A(y,u),p).  From  (3.12)  and 
(3.13)  we  deduce  for  j  =  0, 1, . . . 


J'ivi,  K)  -  J'(y,  u),  (( y3n ,  <)  -  ( y ,  u)j  +  (q3n1  +  Y~—fiAyi  -/)-«.  Avi  -  f 

+  (7"~1  +  aT+T^”  <)  +  (7  -  -  /I2 


+  (7- 


P2 


kn  +  1 


)\yi 


12 . 


pi 


kn  + 1  i=0 


YXAyi-f^Ayi-f) 


-  <>  vi.  -  K) 


kn  +  lf^O 

+  {Pc(A(yi,ui),\i)-<p,c{A{y,u),fi),A(yi,uil)-A(y,u))  =0  , 


-1 

where  :=  0.  From  [4]  it  is  known  that 
i= o 


(3.14) 


p'c(A(yJn,ui),K)  -  p'c(A(y,u),p),A{yi,ui) 


■  A  (y,  u 


>i-\K+i-p\2 


2c 
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for  j  =  0, 1, . . .  ,  kn.  Inserting  this  inequality  in  the  above  equality  and  using  (3.6) 
we  obtain 


i  a  - 1? + «k  -  »i! + - 1  -  -  # 


2  ,  Kn  f  J- 1  j  _  — 12  _  +  1  |^J-1  _  y|2 


2f>2 


j-X 


+‘7 ~  - <'2 -  fcf+l  §<*  -  l’ M  ~  f) 

E«  -  <•  <  -  <)  +  ^(K+1  -  -  W  -  pi2)  <  o 


kn  +  1  i=o 

for  j  =  0,1,...  ,  kn.  Summing  the  above  inequality  over  j  and  using  the  fact  that 

kn  3  1  JU  kn 

EEK^<yE(«;) 

J=1  2—0  jf=0 


we  arrive  at 

kn 

I 

3=0 


E(l#j  -  51s  +  o|<  -  61s)  +  -  Si2  +  ^\rkn  -  f  1 2 


+(7 -4)E IV„  -  /|!  +  (7  -  y)EK  -  <\2  +  ~«-+1  -  Ml2 


3=  0 

+  1  \n  _  _j_  fcn  +  1  , 

2/^2 


Since  (*/„,«„)  =  (i£*,ti£0,  (g„+i,rn+1)  =  (g£",r£")  and  27  >  pi,  27  >  p2  this  implies 


\Vn  -  p|2  +  a|u„  -  u|2  +  ^\qn+ 1  -  <?|2  +  ^k„+1 


Mi  +  1 1 


2pi 


2p2 


■fl2 


+(7  -  f  )\Ayn  -  fl2  +  (7  -  yfe,  -  ft.12  +  2)i7»:.i  -  Pi2 


<^k-?l2  +  ^h-<f  +  ^|v-Pi2, 


1 

2c1 


which  is  the  desired  estimate  (3.10).  Summation  over  n  implies  (3.2).  This  ends  the 
proof.  ■ 


3.4.  Adding  Gauss-Seidel  Spitting.  Our  final  goal  is  the  analysis  of  Gauss-Seidel 
splitting  techniques  to  solve  the  auxiliary  problems.  A  similar  approach  was  taken  in 
[1] .  The  splitting  avoids  the  minimization  of  the  auxiliary  problem  with  respect  to  y 
and  u  simultaneously.  The  new  algorithm  is: 
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Algorithm  A„s 

•  Step  1.  Initialization:  Set  n  =  0,  choose  7  >  0,  q0  £  L2(£l),  rQ  £  L2(T), 
U— 1  G.  U. 

•  Step  2. 

yn  =  Arg  min  Ly(y,  un_  1,  rn) 

A  iV&K 


Step  3. 


un  =  Arg  min  Ly(yn ,  u,  r„) 

u£U 

qn+ 1  =  Qn  +  Pi  (Ay„  -  /)  where  px  £  (0, 27]  , 
r„+ 1  =rn  +  p2  ( yn\r  -  un)  where  p2  £  (0, 7]  . 


Theorem  3.3.  Under  the  assumptions  of  Theorem  3.1  the  iterates  (yn,un,qn,rn)  of 
Algorithm  A„s  satisfy 


\Vn  -  y\a  +  «l«n  -  u|r  +  -  «ln  +  K+i  _  rlr 


+  (7  -  y)l Ayn  -  f\l  +  (7  -  y)K  -  Ifelr 
+  (a  +  y)| un  -  un-ilr  +  -  «lr 


/>2> 


(3.15) 


<  -  <?l«  +  ^lr™  _  flr  +  (^y^)K-i  -  ?/n-i|r+|K-i  -  «lr 

for  all  n  =  0, 1, 2, . . .  .  This  implies 


E  ll/n  -  I/In  +  «  L  K  ~  wlr  +  (7  -  y )  Z)  “  /I 


n=0 


n=0 


n=0 


(3.16)  X]  lu«  _  2/nlr  +  (a  +  y )  —  w«-i  lr 

Z  n=0  Z  n=0 


P2x 


<  _L|0  _^|2  ,  J_,r  _f|2  ,  (7-P2) 

-2Pl'9°  9ln+2p2|0  lr  + 


/y 

2  ,«-i-W-ilr+  glu-i-ulr  - 


Proof.  The  optimality  conditions  for  the  two  auxiliary  problems  of  Step  2  give 

(jy(yn,un-i),y-yn')  +  (qn,  A(y  -  ySj ^  +  (rn,y-yn^ 

+7 (^Ayn  -  /,  A(y  -  yn)^  +  i(yn  -  un_  1,  y  -  ynJ^  >  0  for  all  Axy  £  K  , 


'in)i 


and 

( JuiVni  ^n)>  ^ 

(3.17) 

\  /  r  \ 

( 

-l[yn-Un,U 
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Adding  these  two  inequalities  and  combining  with  (3.3)  we  obtain 

(  J'iVn,  Un)  -  J'(y,  u),  (yn,  Un)  -  (y,  u)  ) 

V  /  nxr 

(3.18)  +  (qn+i  -  q,  A(yn  -  y))^  +  (rn+i  -r,y„- 

+(7  -  PiMVn  -  /In  +  (7  -  P2)\yn  -  un\r  +  l(un-i  ~un,y-  yn^  <  0  . 

This  expression  coincides  with  (3.5)  except  for  the  last  term  on  the  left  hand  side  of 

(3.18) .  Proceeding  as  in  the  proof  of  Theorem  4.1  we  therefore  obtain 

I Vn  -  y|n  +  a | un  -  u\l  +  — 1<7„+ i  -  q\h  +  ^lr«+i  ~  ^lr 

(3.19)  +(7  -  y )\Ayn  -  f\2n  +  (7  -  y  )\yn  ~  «n|r  +  T^n-l  -Un,y-  Vn)r 
<2^k-^+2^K-r^. 

The  method  of  estimation  of  ftt„_ j  —  un,y  —  y„)  is  standard  [3],  but  is  given  for 
the  sake  of  completeness.  First  we  note  that 


(3.20) 


(^n— 1  J/n)  (^n  un-l  >  Vn  1/n— l) 

+  (^n  ttn~l 


yn- 1  -  y-  un- 1+  u 


+  ~ ( I un  —  1t|p  —  —  u|p  —  \un  tin_i|r)  . 


Using  the  optimality  condition  (3.17)  for  n  and  n  —  1,  with  u  =  un  and  u  =  un_i 
respectively  and  adding  the  two  resulting  inequalities  one  arrives  at 

-a\un-un-i\l+(rn-rn-i,un-un-1^-'y(yn-yn-i-(un-Un-i),un-i-un^  >  0  , 

which  in  turn  implies 


(3.21)  'y^Vn  Vn—  lj^ra  ^n— 


^  (7  l|p  p2  ^n—  liVn—  1  ^n— 1 

for  n  =  1, 2, _ Inserting  (3.21)  into  (3.20)  implies  that 

7  (u«-i  —  un,  V  ~  3/«)r  ^  (a  +  ^)|w„-w„-i|r 

+{l~P2)(u 


i,  ’Un-hUn-l  Un—  1 


+  Z{\Un-u\l~  |«n-!-«lr) 
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f) 2  '"V  -  1 2  /  \  I  1 2 

~  2  l^n—  1  ^n—  1  |r  H  ^"Jj 'U'n  ^n— i|r 

(3.22)  +^(K  -  «lr  -  K-i  -  «|r)  . 

where  p2  <  7  is  used.  Inserting  (3.22)  into  (3.19)  the  desired  inequality  (3.15)  follows. 
The  second  inequality  in  the  claim  of  the  theorem  follows  by  a  telescoping  argument 
from  the  first  one.  ■ 

Once  again,  we  may  use  algorithm  A\  to  solve  the  first  sub-problem  of  Step  2.  The 
second  one  is  easily  solved  directly,  see  Remark  3.1  below.  For  convenience  we  shall 
henceforth  delete  the  index  1  in  the  notation  of  the  state  component  of  the  multiplier. 

Algorithm  AGS 

•  Step  1.  Initialization:  Set  n  =  0  and  choose  7  >  0,  c  >  0. 

Choose  (&, r0)  e  L  x  L2(F),  A„  e  L2(Q)  and  «_a  e  L2(r)  . 

•  Step  2.  Choose  kn  £  IN,  set  \°  =  \n,  u~l  =  1  and  for  j  =  0, . . .  ,  kn 

'  y{  =  Arg  min  i7(y,  r„)  +  (fiMiv,  <-1),  (\Jn,  0)) 

yew 

A 2+1  =  c[A1yi  +  ^-PK(A1yi  +  ^)}, 

K  =  Arg  min  L^(y{,u,qn,rn) 
ueU 

End  of  the  inner  loop:  An+i  =  A£n+1,  yn  =  y*n  ,un  =  u*n. 

•  Step  3. 

kn 

Qn+ 1  =qn  +  T- TT  I2(Ayi  -  /)>  where  Pi  6  (0, 2q]  , 

"t"  t  j_Q 

Pi  ^ n 

rn+i  =rn  +  t~7  Y,(yi\r  ~  <)>  where  P2  e  (0, 7]  • 

"'to  1  J-  j— q 


Remark  3.1.  The  second  minimization  problem  is  indeed  equivalent  to 

aud  +  rn  +  I 


Arg  min 
ueU  , 


u  — 


cc  +  7 


,  .  i  ■  j.u  7"2/T'\  .  .  •  r  Ck'UcJ  -j-  rn  T  7f/n  rr 

that  is  u3n  is  the  L  ( 1 ) -projection  0}  - - — -  on  U . 

We  may  now  end  this  section  with  a  convergence  result  for  Algorithm  AG5. 


Theorem  3.4.  Let  (y,  u)  be  the  solution  to  (V)  and  suppose  that  (H)  holds.  Let 
(q,r>p)  €E  L2(Ll)  x  L2(r)  x  L  x  L2(r)  be  a  Lagrange  multiplier  associated  to  the 
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state  equation  and  the  state  constraint.  Then  the  iterates  (yn,un,qn,rn)  of  Algorithm 
Ags  satisfy 

\yn  -  l/|n  +  (a  +  ^)K  -  u\l+^^—\qn+i  -  q\2n+  \rn+i  -  Hr 

+  (7  -  ^-)\AVn  -  f\n  +  7  9  P2\Un  -  Vn\r  +  ^“|An+l  _  kll 

(3.23)  2  k  4-  1  2  k  +  1  n 

-  2P2  |r-  r|r 
4-— |An  —  4  — —  |wn_i  —  y»-i|r  4“  ~^\un-\  ~  w|r 


for  all  n  =  1,2, ...  .  If  kn  is  nonincreasing  this  implies 


OO  / 

5Z  (\y*-y\ 

n=  1  V 


n  4-  a\ un  -  u\l  +  (7  -  y  )|Ayn  -  f\2n  4-  un 


-  2/n|r)  — 


"'2pi 1  lgi  _  +  ^2^2 1  ^  ~  +  2c^Al  _  +  7  2 P2  ~~  +  2  ' 

Proof .  We  combine  the  techniques  used  in  the  proofs  of  Theorems  3.2  and  3.3  which 
allows  to  omit  some  details.  Once  again  we  use  the  optimality  conditions  issued  from 
Step  2  of  Algorithm  AGS  .  The  iterates  (j/£,.<)  of  Step  2  satisfy,  for  j  =  0, . . . ,  kn, 
for  all  y  €  W 


(j'y{yi,K  l),y))^  +  (qn  +  ^(Ayi  -  /),  Ay)  ^ 
(3.24)  +(7  -  k  P]~f){Ay3n  -  /.  Ay)n  4-  (r„  4-  {vL  ~  u3nl)i  v)T 

+{l  -  -  un-1’ y) r  +  (^c,i(Ai2/n>  A* ),  aw))l  =  °> 


and  for  all  u  €  U 


(j'uiyl  Ui)’U  (r™  +  k  Un)’U 


Relation  (3.12)  implies 


(V(y,  u),  (y,  u  -  ui)  4-  (<?,  Aj/)n 

+  (f,j/-(u-u))r4-  (/bAw)  >0 

\  /  LxL2(T) 


(3.26) 
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for  all  ( y,u )  &W  xU.  Combining  (3.24)-(3.26)  and  (3.14)  of  the  proof  of  Theorem 
3.2  implies 

-|2  ,  „|..7  -|2  ,  +  1 ,  ,.l  =12  kn  +  1 -|2 


li£  -  2/In  +  -  «lr  + 


2pi 


2pi 


\<fn  1  -«|f 


+  (7 
+  (7 


Pi  M  A~.i  t\2  ,  fc«  +  l,  ,2  fcn  +  1  i  j— 1  -|2 

lrn-rlr  r  n  Mr 


P2 


2(kn  +  1) 

P2 

2(kn  + 1) 

i-i 


)\Ay3n  —  /In  + 


)lwj-«nlr 


2p2 

Pi 

kn  + 1  f 


2p2 


l-i 


E(Ai/n  -  /» Ayi  -  fh 


i= 0 


,  ^  7  E(ld  -  <-  »£  -  <)r  +  ^(|Ai+1  -  /4  -  |A{  -  /i|!) 

Kn  T  1  j=Q  ZC 


+7(<-<  \y3n-y)r<° > 

for  n,  j  =  0, 1, . . .  ,  fcn.  Summing  the  above  inequality  over  j  we  arrive  at 


;|2\  |  kn  +  l  I  fe 


E(k£  -  y\lM<  -  «lr)+  9 

1=0  ZP! 

Pi. 


kn"  -  Q\h  + 


kn  +  1 

2p2 


rn"  ~  i'lr  +  ^IAn+l  -  pin 


< 


+(7  -  f-)  E  -  /ln+(7  -  y)E  llfii  -  <lr+7E  (<  -  <  \vi  ~  V) 

1  i= o  z  l=o  l=o  v  '  r 

Now  we  estimate  -  u }n~1,  y3n  -  yj  as  in  the  proof  of  Theorem  3.3.  We  obtain, 

for  j  =  1,2,...  and  n  =  0, 1, . . . 

7 (<_1  -  <,  P  -  2/£)r  >  -  <_1lr  -  JOl/T1  -  <_1  lr  +  l<_1  -  Glr  -  K  -  «lr)  • 

A  similar  calculus  provides  the  estimation  of  (u°n  —  u~l,  y°n  —  yj  for  n  =  1, 2, . . . 


(3.27) 


7l iv  >  («  +  y)K-w1lr  +  ^rIly™-i ~  v1!* 


1 12  ,  P2 _ 7  I 

2 


+^K-«lr-^K1-«lr  • 


We  henceforth  assume  n  >  1.  We  obtain 


fc, 

5 

1=0 


7  E(un~<  1>y3n-y)  >(Q  +  ^)K~Hn-l|f+P29  7kn-l-Mn-llr 


Kn 

+  ?K  -  «lr  “  ?lu™-i  “  “lr  +  «  E  l<  “  <_1!r 


i=i 


-  |  E(kn  1  -  <  1|r  +  l<  1  -  «lr  -  l<  -  «lr) 

zl=i 


(3.28) 
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We  finally  get  for  kn  >  1 


Ed^n  -  V\l  +  a\<  -  Mlr)  +  1  -  ?ln  +  ^ 

i=o 


2pi 


2p2 


:K+i  -rlr 


f|A„+1-/i|i  +  (7-^)El^n-/ln  + 

ZC  Z  j=0 


kn 


1~P2 


fl\yi-<\r  +  Ik 


j= 0 


< 


kn  +  1 


l?n  -  <?|fi  + 


kn  + 1 1 


^|2 


2pi 

+  2^1^™  _  P\l  + 


2p2 
7-P2 


ry 

^  |l/n  —  Un-l  |q  +  ^  j^n— 1  —  ^|r  • 


Since  p2  <  7  we  deduce  that  if  kn  >  1 

kn  + 1 1 


^n  T  f  1 


■  . 9  l  _ 1 9  /Vyj,  I  -L  I  _ 1 9  'A'Tl  I  J-  I  — 12 

ll/n  —  2/|n  +  oi\un  —  u|r  H — — — kn+i  —  ?ln  H  ^7  lrn+i  _  rlr 


+  (7  _  ^)\Ayn  ~  /In  + 


2pi 

2  ,  7-P2 


2p2 


/y 

2 - kn  —  “nlr  +  T^l-Wl  —  +  2  ~  “lr 


u\ 2 


“I-  1 1  _|2  1  fan  ^  I  — 12  .  ^  I  \  - 12 

1  -  «l»  +  ^r'r"  -  r'r  +  S|A>  -  <‘l" 

+  ^ ^  I2/0 — 1  —  wn-ilr  +  ^|un-i  —  ^Ir  • 

Using  (3.27)  the  same  estimate  follows  for  kn  =  0.  The  final  claim  again  follows  with 
a  telescoping  argument.  ■ 


4.  Numerical  Experiments 


4.1.  Implementation.  Numerical  experiments  were  carried  out  for  one  and  two  di¬ 
mensional  problems  but  we  present  only  a  lD-example.  Since  Algorithm  AGS  is  the 
simplest  for  implementation  we  have  used  it  for  our  tests.  The  discretization  of  the 
problem  was  done  with  finite-differences  discretization  schemes.  The  size  of  the  grid 

was  so  that  L  =  IR^"1"1.  A  was  chosen  as  the  discretization  operator  with  respect 
to  the  given  equidistant  grid. 

The  main  difficulty  that  remains  in  applying  Algorithm  AGS  is  given  by  the  (un¬ 
constrained)  minimization  with  respect  to  y.  This  was  done  via  the  adjoint  state 
equation  and  results,  for  fixed  u ,  q  and  r  in  the  resolution  of 


(4.1) 


A*p  =  y  -  zd  +  c  [y  +  -  -  PK{y  +  -)]  in  ft,  p  =0  on  T 
Ay  =  f-!±Z  =  +  on  T 


7 


7  7  dvA- 


for  p  and  y.  Here 


dp 

dvA, 


denotes  the  conormal  derivative  of  p  with  respect  of  A* 


(which  is  the  adjoint  operator  of  A).  The  control  function  was  computed  using  the 

too  ■  f  r  +  aud  +  72/ 

L-pro  lection  of -  on  U . 

a  +  7 
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All  numerical  tests  were  carried  out  on  an  HP  workstation  using  the  MATLAB© 
package.  The  required  accuracy  and  stopping  criteria  were  set  to  10-6. 

4.2.  lD-Example.  In  this  example  we  chose  Q  =]0, 1[  and  N  =  30,  A  =  —A  and 
f(x)  =  -{x  +  2)  exp(x),  zd  =  -1,  a  =  0.1,  ud(0)  =  -2,  nd(l)  =  1,  U  =  [-3, 3]  and 
K  =  {  Y  g  L  |  -  1.1  <Y  <  1  }. 

Note  that  zd  is  quite  close  to  the  boundary  of  K. 

In  fact,  as  can  be  seen  from  Figure  1,  the  lower  bound  on  the  state  is  active.  The 
active  set  is  a  singleton.  In  view  of  the  fact  that  the  influence  of  the  boundary 
control  at  x  =  0  and  x  =  1  is  restricted  to  the  superposition  of  straight  lines  to  the 
uncontrolled  state,  this  is  not  surprising. 


Solution  for  ud=[-2,1]  and  zd=-1 


Figure  1 


c 

7 

kn  (constant) 

II  Ay  +  f\\oo 

l|y-ull°o 

n 

CPU  units 

min[y-(-l.l)] 

10 

1 

10 

4.8  e-07 

4.  e-07 

58 

1 

4.  e-10 

10 

1 

1 

9.3  e-07 

6.  e-07 

154 

2.17 

2.5  e-06 

10 

1 

100 

2.2  e-07 

5.  e-07 

13 

1.35 

-2  e-09 

100 

10 

10 

6.2  e-07 

9.  e-07 

95 

1.01 

-1.3  e-11 

Table  1 


One  of  the  main  questions  concerning  the  class  of  algorithms  that  we  analyzed  is 
the  choice  of  the  parameters  p; ,  c  and  7.  From  Table  1  we  conclude  that  while  the 
choice  of  the  parameters  certainly  has  an  influence  on  the  convergence  properties  of 
the  algorithm,  there  is  a  wide  range  of  parameters  values  for  which  convergence  is 
achieved  numerically,  for  this  and  other  examples  that  we  tested.  In  all  calculations 

7  1 

we  chose  p;  =  1.  Some  tests  shows  that  the  ratio  -  =  —  is  a  good  one.  For 

c  10 

(c, 7)  =  (1,1),  (c, 7)  =  (100,50),  (0,7)  =  (1,0.5)  (all  with  kn  =  10  for  all  n), 
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convergence  is  achieved  but  it  is  slower  than  for  those  pairs  that  are  presented  in 
Table  1.  Prom  that  Table,  as  well  as  from  other  tests,  it  can  also  be  seen  that 
the  auxiliary  problem  should  be  solved  sufficiently  accurately,  before  the  Lagrange- 
multipliers  (q,  r)  for  the  state  equation  and  the  boundary  condition  are  updated  (see 
kn  =  1).  The  values  (c,  7)  =  (10,0.1)  still  with  pt  =  1  led  to  divergence.  This  is  not 
unexpected  in  view  of  the  result  of  Theorem  4.5  which  requires  p2  <  7. 

The  numerical  values  for  J  and  the  control  at  the  minimum  are: 

J  =  1.5862  lfT1  and  u(0)  =  -9.9573  10-1,  u(l)  =  2.6314  10-2  . 

4.3.  Conclusion.  The  augmented  Lagrangian  algorithms  with  splitting  into  state  and 
control  variable  can  effectively  be  used  to  solve  state  and  control  constrained  opti¬ 
mization  problems.  For  the  first  order  methods  that  are  presented  in  this  paper  the 
auxiliary  problems  in  the  inner-loop  must  be  solved  sufficiently  accurately  before  the 
Lagrange-multipliers  of  state  equation  and  boundary  condition  are  updated.  Ap¬ 
propriate  choices  for  the  penalty  parameters  (here  c  and  7)  and  the  step  lengths  pi 
for  the  dual  variables  are  easily  determined  since  the  algorithm  are  not  particularly 
sensitive  to  them. 
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Abstract.  We  study  an  approximation  of  the  boundary  control  problem  for  the  heat  equa¬ 
tion  over  a  finite  horizon.  Our  goal  is  to  obtain  an  approximation  of  the  value  function  and  of 
the  corresponding  “locally  optimal”  trajectories.  We  examine  here  a  time  discretization  also 
proving  some  a  priori  estimates  of  convergence  for  the  value  function  of  the  time-discrete 
problem.  Some  hints  are  also  given  for  the  construction  of  a  fully  discrete  scheme. 
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1.  Introduction 

We  deal  with  the  finite  horizon  control  problem  for  a  system  governed  by  the  heat 
equation  focusing  our  attention  on  its  numerical  approximation.  In  particular,  we 
will  examine  in  detail  the  case  of  boundary  controls.  We  refer  to  Lions  [17]  for  the 
theoretical  framework  of  the  continuos  problem. 

It  is  well  known  that  via  the  Pontryagin’s  maximum  principle,  one  can  characterize 
the  open-loop  solution  in  terms  of  a  coupled  system  involving  the  state  y(u)  and 
the  adjoint  state  p(u)  ( u  is  the  control).  In  this  formulation  the  adjoint  state  satis¬ 
fies  a  backward  heat  equation  in  [0,T]  having  y(x,T\u )  as  initial  condition  so  that 
the  numerical  solution  of  the  coupled  system  requires  a  long  iterative  process.  This 
approach  has  been  followed  by  Hackbush  ([13],  [14])  who  has  applied  the  multigrid 
technique  to  obtain  the  solution  of  the  system  describing  the  necessary  conditions  for 
optimality.  Note  that  the  approximation  of  the  system  of  necessary  conditions  re¬ 
quires  a  good  initial  guess  for  the  multipliers  (the  adjoint  states)  to  start  the  iterative 
procedure  and  to  guarantee  a  local  convergence  to  the  solution.  Searching  for  the 
initial  guess  is  often  one  of  the  more  difficult  tasks  in  that  approximation  method. 
Having  in  mind  those  limitations,  we  consider  here  a  different  approach  where  Dy¬ 
namic  Programming  plays  a  role  also  in  the  construction  of  the  approximation  scheme. 
The  main  goal  is  to  reduce  the  global  complexity  of  the  algorithm  and  to  avoid  the 
cumbersome  solution  of  a  Hamilton- Jacobi  type  equation  in  the  whole  space  of  initial 
data  (which  in  our  case  would  require  the  solution  of  a  nonlinear  PDE  in  L2(D)). 

This  work  has  been  partially  supported  by  the  Ministry  for  University  and  Scientific  Research 
(MURST  Project  “Analisi  Numerica  e  Matematica  Computazionale”). 
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Our  method  is  based  on  a  backward-forward  procedure  where  we  try  to  compute  a 
trajectory  minimizing  the  pay-off  in  a  neighbourhood  of  a  reference  trajectory  (which 
is  assumed  to  be  given).  Prom  this  point  of  view  this  paper  is  close  to  the  numerical 
experiments  presented  for  finite  dimensional  control  problems  in  [1]. 

We  will  establish  convergence  for  a  time-discrete  scheme  and  we  will  also  give  some 
hints  for  the  construction  of  the  fully  discrete  algorithm  based  on  a  space  discretiza¬ 
tion  (by  finite  elements  or  finite  differences).  A  similar  approach  has  been  followed 
by  Ferretti  for  distributed  and  boundary  control  problems  ([8],  [10])  without  state 
constraints  and  localization  around  a  trajectory.  Among  the  huge  amount  of  liter¬ 
ature  on  the  control  of  parabolic  type  equations  we  quote  the  paper  [2]  where  the 
approximation  of  the  linear  regulator  problem  is  considered  and  [16]  where  abstract 
convergence  results  for  approximation  methods  have  been  given  mainly  for  the  linear 
quadratic  problem  (see  also  the  reference  therein  and  [15]).  More  recently  Banks 
and  Ito  [3]  have  studied  the  numerical  aproximation  of  boundary  control  problems 
by  means  of  augmented  lagrangian  techniques. 

The  paper  is  organized  as  follows.  In  Section  2  we  set  up  the  problem,  introduce  our 
notations  and  prove  some  useful  properties  of  the  value  function  for  the  continuous 
control  problem.  Those  properties  are  rather  important  for  the  construction  of  the 
discretization  scheme  and  also  play  a  role  in  the  proof  of  the  dynamic  programming 
principle  which  is  given  in  the  Appendix.  The  time  discretization  is  treated  in  Section 
3  where  we  prove  a  discrete  Dynamic  Programming  Principle  which  will  be  used  to 
obtain  the  approximation  scheme  and  to  prove  our  convergence  result.  Section  4  is 
devoted  to  the  presentation  of  the  basic  ideas  for  the  numerical  algorithm. 


2.  Some  preliminary  results  for  the  continuous  problem 

Let  fi  be  un  open  subset  of  Mn,  P  =  dfl  and  to,  T  be  two  real  numbers  such  that 
T  >  t0.  We  set  Q  =  D  x  (t0,  T)  and  S  =  r  x  (f0,  T). 

We  will  say  that  P  is  regular  if  it  is  piecewise  C 1  in  the  sense  given  by  the  definition 
of  [18,  p.  25], 

Let  E  be  an  Hilbert  space,  we  will  use  the  following  spaces 

fC 

L2(t0, T;  E)  =  {h  :  [to,  T\  — >  E such  that  /  ||  h(s)  |||  ds  <  +oo} 


W(t0,  T)  =  {h  G  L2(t0,  T;  E)  such  that  —  €  L2(t0,  T;  E')} 

where  the  derivative  dh/dt  should  be  understood  in  the  distributional  sense.  We  will 
consider  a  boundary  control  problem  with  Neumann  condition  for  the  heat  equation: 

{  §j.y{x,  t)  -  Aj/(a:,  t)  =  g(x,  t)  in  Q 

£y{x,t)  =  u(x,t)  onE 

y(x,  t0)  =  t)(x)  in  Q 


(2.1) 


< 
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where  77  is  given  in  L2(£l),  g  £  L2(t0,T-  H\Q)),  and  the  controls  u  belong  to  U  = 
L2{ £),  i.e.  we  start  investigating  the  problem  without  constraints.  Later  on  in  this 
section  we  will  also  introduce  some  constraints  on  the  control  space. 

We  will  always  consider  solutions  of  (2.1)  in  the  weak  sense.  We  will  denote  that 
solution  by  yn(x,t-,u)  and  when  it  will  be  possible  without  ambiguities  we  will  also 
use  the  short  notation  yv(t\u )  or  yv(t). 

In  order  to  have  existence  and  uniqueness  for  the  solution  of  (2.1)  for  every  fixed 
u  £U  one  can  apply  the  standard  results  for  the  heat  equation  (see  f.e.  Theorem  1.2 
p.102  in  [17]).  The  solution  depends  continuously  on  the  data  and  lives  in  W(to,T). 
Sometimes  we  will  refer  to  L2(Q)  as  the  space  of  observations,  i.e.  the  space  where 
the  solution  lives  for  every  fixed  t  in  [t0,T]. 

In  order  to  simplify  our  notations,  let  us  define 

|[  u(t)  ||2=  J  |  u(x,t)  |2  dx  , 

||  y(t\u)  ||2=  J  |  yv(x,t;u )  |2  dx  . 

The  final  time  T  being  fixed,  for  every  initial  condition  ( rj ,  t0)  and  for  any  control  u 
we  define  the  pay-off: 

(2.2)  J(r),  t0,  u)=  (  u ),  u(t))  e~x(t~to)dt  +  ip{yv(T;  u)),  e_A(T_to) 

JtQ 

where  /  :  L2(Q)  x  L2(r)  —>  R,  ip  :  L2(f!)  — >  R  and  A  >  0.  Note  that  the  discount 
factor  e~x^t~to'>  appearing  in  the  functional  (2.2)  is  usually  included  to  up-date  all 
the  cost  at  the  initial  time  (when  the  decision  has  to  be  made)  and  to  decrease  the 
costs  over  long  time  intervals. 

We  will  make  the  following  assumptions: 

(a)  there  exists  a  constant  Bf  such  that 


(2.3) 

\f(y,u)\  <  Bf,  for  any  (y,u)  £  L2(Q)  x  L2(r)  , 

(2.4) 

/  is  continuous  with  respect  to  (y,u); 

(b)  there  exists 

u  £  L2(r), 

a  constant  Cf  >  0  such  that  for  any  z,w  £  L2(Q.)  and  for  any 

(2.5) 

| f(z,u)  -  f(w,u) |  <Cf  ||  z-w  ||, 

(2.6) 

/  is  convex  with  respect  to  (y,  u); 

(c)  there  exists  a  constant  such  that 

(2.7)  \tp(y)\  <  for  any  y  £  L2(fl); 

(d)  there  exists  a  constant  >  0  such  that  for  any  z,w  £  L2(fl) 

(2.8)  ^(2:)  -  ip(w)\  <  ||  2  -  w  ||, 

(2.9)  ip  is  convex. 
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We  want  to  minimize  J,  i.e.  we  want  to  find  an  optimal  control  u  G  U  such  that 

(2.10)  J(r),t0,u)  =  inf J(g,t0,v). 

The  value  function  for  our  problem  is  defined  as 

(2.11)  v(rj,t0)  =  inf J{rj,t0,u). 

u€.U 

Definition  2.1.  The  system  (2.1 )  is  said  to  be  controllable  if  for  u  £  U,  the  observa¬ 
tion  y(T ;  u)  spans  a  dense  subspace  in  the  space  of  observations. 

For  the  proof  that  our  system  is  controllable  when  U  =  L2(E)  see  [17,  p.  207].  It 
is  clear  that,  in  general,  the  existence  of  an  optimal  control  attaining  the  minimum 
of  J  is  not  guaranteed.  In  order  to  prove  it  we  need  some  additional  assumptions. 

Theorem  2.2.  Let  (2.3),  (2.4),  (2.5),  (2.6),  (2.7),  (2.8),  (2.9)  be  satisfied.  Let 
Uad  =  {u  e  U  :\\  u  ||i2(S)<  i?}  and  assume  that  f(y,-)  is  continuous  for  any  y  G 
L2(D). 

Then,  there  exists  a  control  u  G  Uad  such  that 

(2.12)  J{g,t0,u)  =  inf  J(g,t0,v). 

v(=Uad 

Proof.  Let  us  define  cp  :  Uad  -*  R  as  cp{u )  =  J(g,t0,u).  In  order  to  get  the  result 
we  will  apply  Corollary  III. 20  in  [6]  (for  readers  convenience  its  exact  statement  has 
been  included  in  the  Appendix)  so  let  us  check  that  all  the  assumptions  are  satisfied. 
It  is  easy  to  see  that  Uad  is  a  closed,  bounded,  convex  subset  of  the  reflexive  Banach 
space  L2( £).  By  (2.3)  and  (2.7)  we  have  that  f  ^  +oo. 

To  prove  the  convexity  and  the  continuity  of  f  we  first  observe  that  for  any  a  G  (0, 1) 
and  for  any  u,  v  G  Uad, 

(2.13)  yv{x,t\au  +  (1  -  a)v)  =  ayv(x,t;u)  +  (1  -  a)yv(x,t\v). 

In  fact,  we  can  write  yv(x,t-,au  +  (1  —  a)v )  =  a(x,t)  +  bau+(i-a)v(x,t)  where  a(x,t) 
is  solution  of  (2.1)  for  u  =  0,  i.e. 

'  fty(x,t)  -  Ay(x,t)  =  g{x,t)  in  Q 

(2.14)  ■  £y(x,t)=0  onS 

y(x,  t0)  =  y(x)  in 

and  bau+(i-a)v(x,  t )  is  solution  of  (2.1)  for  g  =  0  and  g  =  0  with  the  control  au+  (1  — 
a)v,  i.e. 

f  y(x,  t)  -  Ay{x,  t)  =  0  in  Q 

(2.15)  <  ^ y{x,t )  =  au  +  (1  —  a)v  on  £ 

y(x,t0)  =  0  infi. 

By  linearity  of  the  equation  we  have  that  6a„+( i-a)v(x,  t )  =  abu(x,  t)+(l—a)bv(x,  t ) 
and  this  give  us  (2.13). 
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By  the  convexity  of  ip  and  /  and  by  (2.13)  we  can  easily  prove  that  <j>  is  convex. 
Let  us  prove  that  (p  is  lower  semicontinuous. 

Let  un  E  Uad  be  a  sequence  converging  to  u  in  L2(E).  By  the  weak  formulation  of 
the  heat  equation  and  exploiting  the  linearity  of  the  dynamics  we  get  the  following 
estimate 


(2.16)  ||  Vv(t]  U)  -yr,{t-,Un)  ||<||  U-Un  ||i2(S)  . 

The  continuity  of  ip(yr](T-,u))erx(T~to'1  with  respect  to  u  then  follows  by  (2.8)  and 


(2.16). 

Let  us  examine  the  integral  term  appearing  in  the  definition  of  <j>. 

Jto 

=  [  u),  u(t))  -  f{yn(t ;  0),  u(t))]e-x{t-to)dt  + 

Jto 

+  f  [. f(v v(t;  0 ),u(t))  -  f{yv{t ;  0),  0 )]e_A(t“*o)dt  + 

Jto 

[  f{yv{t\0),0)}e-x(t-to)dt  . 

Jto 


(2.17) 


+ 


By  (2.16)  and  the  Lipschitz  continuity  of  we  get 

/  \f(Vr,(t-,  u),  u(t))  -  f(yv(t ;  0),  u(f))]e-A(*~to)df  <Cf\\u  ||La(E)  . 

Jto 

Note  that  the  continuity  assumption  on  f(y,  ■)  implies,  by  the  Fatou’s  Lemma,  that 

I(u)  =  f  f{yr,{t]v),-)erx{t^to)dt 

Jto 

is  a  lower  semicontinuous  function  over  Uad-  Then,  we  can  conclude  that  <f>  is  lower 
semicontinuous  over  JJad  since  the  last  term  in  (2.17)  is  constant  with  respect  to  u. 
The  proof  can  be  completed  simply  applying  Corollary  III. 20  in  [6].  □ 


Assuming  that  /  and  ip  are  strictly  convex,  we  can  conclude  that  the  optimal 
control  is  unique.  Note  that  hypothesis  (2.6)  is  fullfilled  when  the  running  cost  /  has 
the  form  f(y,u)  =  fi{y)  +  /z(u)  with  /i  and  f2  convex. 

The  Lipschitz  continuity  of  the  cost  function  with  respect  to  the  initial  data  is 
established  in  the  next  theorem. 


Theorem  2.3.  Let  Q,  be  a  bounded  open  set  with  a  regular  boundary  and  let  the  as¬ 
sumptions  (2.5)  and  (2.8)  be  satisfied.  Then,  there  exists  a  constant  C  >  0  such  that 
for  any  u  in  U 

(2.18)  \j(j],t0,u)  —  J(fj,t0,u)\  <  C  ||  r?  —  n  ||  ,  V7?,/x  e  i2(fi). 

Proof.  We  can  write  yv(x,tm,u)  =  av(x,t )  +  b(x,t,u)  where  a^xfi)  is  the  solution  of 
(2.1)  for  g  =  0  and  u  =  0,  and  b(x,t,u )  is  solution  of  (2.1)  for  rj  =  0. 

The  map  av(x,t )  =  A(t)rj(x)  is  linear  in  rj  by  linearity  of  the  equation.  Theorem 
7.2-2  p.161  in  [18]  gives  for  g  =  0,  the  continuous  dependence  of  the  solution  with 
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respect  to  the  initial  data.  We  can  conclude  that  it  exists  a  constant  Ca  >  0  such 
that 

(2.19)  ||  A(t)r)(x)  ||<  Ca  ||  7]  II  ,  Vt  G  [ t0,T }  . 

Let  us  denote  by  yv{x,  t;  u )  the  solution  of  (2.1)  and  by  y^x,  t\  u )  the  solution  of  the 
same  system  with  initial  data  p(x). 

The  linearity  of  A  and  the  inequality  (2.19)  imply 

(2.20)  \\yv(tiu) -y^u)  \\=\\  A(t)(r)  -  (x)  \\<Ca\\r]  -  v\\  ,  Vte[t0,T\. 
Recalling  the  definition  of  the  pay-off,  by  (2.5)  and  (2.8)  we  get 

|  J(r),t0,u)  -  J(fj,,t0,u)  |  <  /  \f(yv(t;u),u(t))  -  f(yll(t-,u),u(t))\e~x(-t^to)dt  + 

Jto 

+  myv(T))  -Hyp(T))\e-x{T-to)  < 

<  f  Cf  II  yv(t)  -  y^t)  II  dt  +  C \p  ||  yv(T)  -  y^T)  ||  . 

J  to 

By  (2.20),  we  can  conclude  that  there  exists  a  constant  independent  from  u,  C  = 
CfCaT  +  C-^Ca,  such  that 

(2.21)  |  J(ri,t0,u)  -  J(n,t0,u)  |<  C  ||  b  -  ft  ||  . 

□ 

As  an  easy  corollary  we  get  the  Lipschitz  continuity  of  the  value  function  with  respect 
to  the  initial  data. 

Corollary  2.4.  In  the  same  hypotheses  of  Lemma  2.3  there  exists  C  >  0  such  that  for 
any  rj,n  €  L2(fl)  we  have 

(2.22)  \v(r},t0)  -  v(n,t0)\  <  C  ||  rj  -  p  ||  . 

We  state  now  the  Dynamic  Programming  Principle  which  will  give  a  characteriza¬ 
tion  of  the  value  function  also  useful  for  numerical  purposes  (see  the  Appendix  for 
the  proof). 

Theorem  2.5.  Let  the  value  function  be  defined  as  in  (2.11).  Then 

(2.23)  v(r),t0)  =  inf  {  [  f(yv{t]u),u(t))e-x{t~to)dt  +  u(^(r),  r)e“A(T-to)} 

uEU  {'Jto  ' 

for  every  t,  to  <  r  <  T. 

It  is  well  known  that  the  Dynamic  Programming  Principle  gives  a  characterization 
of  the  value  function  where  the  value  at  the  initial  condition  v(r],to)  depends  on 
its  value  at  the  point  y^r)  belonging  to  the  optimal  trajectory.  This  is  the  basic 
principle  of  optimality  for  the  trajectories.  This  principle  is  also  useful  to  derive  the 
Hamilton- Jacobi-Bellman  equation  giving  a  characterization  of  v  in  terms  of  a  partial 
differential  equation.  In  what  follows  we  will  use  a  discrete  version  of  this  principle 
(Theorem  3.1)  to  get  a  semi-discrete  approximation  scheme  for  the  value  function. 
Moreover,  (2.23)  and  its  discrete  version  will  play  an  important  role  when  proving 
the  convergence  of  that  scheme  to  the  value  function  of  our  problem. 
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For  the  numerical  approximation,  we  are  interested  to  a  particular  choice  of  the 
cost  function  (2.2).  Note  that  an  explicit  dependence  on  t  can  be  easily  included  in 
the  running  cost.  We  will  take  then 

(2.24)  f(t,  yv,  u)  =||  u(t)  || l  +  ||  yr,{t-  u )  -  £(f)  ||2 

where  £  is  a  given  function  from  L2(Q)  in  R  and 

(2-25)  iW))  =11  Vv(T)  ~  zT  ||2 

where  zT  is  a  given  function  from  L2(Cl)  in  R. 

Moreover,  we  also  want  to  restrict  the  space  of  admissible  controls  and  of  the  initial 
conditions  assuming  that 

(2.26)  u  e  Uad  =  {u  eU  such  that  ||  u(-,t)  11^2 (r)<  R  for  any  t  G  [to,?1]}, 

(2.27)  y  £  K  =  {n  €  L°°(Q)  such  that  ||  p  ||oo<  M}. 

Note  that  the  restriction  on  the  controls  which  appear  replacing  U  by  Uad  may 
imply  the  loss  of  the  controllability  property  for  our  dynamics.  However,  under  the 
assumptions  of  Theorem  2.2  there  exists  a  minimum  for  the  pay-off.  In  practice,  we 
can  think  that  £(t)  is  a  trajectory  starting  at  rj  and  reaching  a  neighbourhood  of  our 
terminal  state  z t  obtained  by  an  analysis  of  the  controllability  problem  or  by  some 
experiments.  It  is  what  we  know  about  the  problem  before  starting  the  optimization 
process.  We  want  to  minimize  the  pay-off  given  by 

[T  II  «(i)  112  +  II  yn(t-  u)  -  at)  II2  e-W-^dt+  II  yv(T)  -  ||2 

Jto 

Note  that  the  term 

[  II  Vvfa  u )  -  C(t)  II2  e~A(‘_to)cff 

Jto 

has  been  added  in  order  to  penalize  the  L2-distance  from  £  in  the  interval  [to,  T}.  We 
will  come  back  to  this  point  in  the  last  section,  where  we  will  explain  the  algorithm 
giving  more  details. 

In  the  following  Lemma  we  will  show  that  under  some  restrictive  hypoteses  we  can 
apply  Theorem  2.3  to  the  pay-off  J  corresponding  to  (2.24),  (2.25)  so  that  our  value 
function  is  Lipschitz  continuous  with  respect  to  the  initial  data. 

Lemma  2.6.  Assume  that  Q  is  a  bounded  open  set  with  regular  boundary,  and  that 
(2.26)  and  (2.27)  hold  true.  Then  (2.5)  and  (2.8)  are  satisfied. 

Proof.  Let  us  start  from  the  weak  formulation  of  the  heat  equation.  After  some 
calculations  one  can  obtain  the  following  estimate 

y{t;u)  ||2<||  rj  ||2  +(1  /a)  ||  g  ||i3(0,T;«-i)  +  II  u  IIl*(e) 


(2.28) 
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for  all  t  in  [t0,T\  (a  >  0).  Then,  (2.26)  and  (2.27)  imply 

(2.29)  ||  y(P,  u)  ||<  K  ,  for  all  t  e  [to,  T]  , 

where  K  =  ( M2meas{Cl )  +  (1/a)  ||  g  111 2(o,t-,hi)  +-R2)1/2- 
Fix  g  and  y  in  L2(fi),  t  in  [t0,T]  and  u  in  Uad.  By  the  definition  (2.24)  we  get 

f(yv(t;u),u(t))  -  f(Vr(t-,u),u(t))  =||  yv(t-,u)  -  C  (t)  ||2  -  ||  %.(*;«)  -  CW  f< 

^11  3/ij(^i u)  ~~  y^i u)  II  {  II  %(L u)  ~~  u)  II  T2  ||  yfi{t,  u)  —  ||  |. 

By  (2.29)  we  have 

f(yv(t ; «),  u(t))  -  /(yM(t;  u),u(t))  <  [2 K  +  2(K+  ||  C  II)]  II  %(*; «)  -  u)  II  • 

Inverting  the  roles  of  y  and  g,  after  some  calculations  we  get 

\f{yv{t-,u),u(t))  -  f{y^t-,u),u(t))\  <  Cf  II  Vvfa  u)  ~  u)  II 

where  Cf  =  [2 K  +  2 (K+  ||  C  II)]-  This  proves  (2.5). 

The  proof  of  (2.8)  is  similar  so  we  skip  the  details.  □ 

By  Theorem  2.2,  one  can  easily  see  that  under  the  same  hypoteses  of  Lemma  2.6  we 
have  existence  and  uniqueness  of  the  optimal  control  for  the  particular  cost  functional 
defined  by  (2.25)  and  (2.24).  Note  that  the  proof  of  the  the  Dynamic  Programming 
Principle  is  still  valid  also  when  we  assume  (2.26)  and  (2.27)  (see  Appendix). 

3.  Semi-discretization  and  convergence 

Let  us  introduce  the  discretization  in  time  of  our  problem.  For  the  sake  of  simplicity 
we  will  consider  only  the  case  when  the  time-step  is  constant,  but  the  results  can 
be  extended  to  a  variable  time-step  using  standard  arguments.  Given  IV  e  N  we  set 
A t=[T  -  t0]/N  and  t„  =  to  +  nAt  for  n  =  0, . . . ,  N. 

Let  the  set  of  controls  U  be  replaced  by  the  set  of  admissible  discrete  controls 
U  C  U.  The  set  U  will  play  an  important  role  in  our  discretization.  Just  to  fix  ideas, 
one  can  imagine  that  U  is  some  sort  of  finite  representation  of  U  more  suitable  for  the 
construction  of  the  algorithm  (we  will  come  back  to  this  point  in  the  next  section) . 

We  replace  our  dynamics  (2.1)  by  a  discrete  time  dynamics  obtained,  for  example, 
by  an  explicit  Euler  scheme 

'  y{ x,  tn+i)  =  y{x,  tn)  +  At  [A y{x,  tn)  +  g(x,  tn)} 

(3.1)  ■  ^p?/(-D  Li)  u(x,tn) 

.  y{x,t0)  =  g{x) . 

The  solution  will  be  denoted  by  yv(x,  tn;  u)  and,  whenever  is  possible,  we  will  use  the 
short  notations  yv(tn;  u )  and  yv{tn),  for  every  n  =  0, . . . ,  N  and  for  every  u  €  U. 
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For  any  n0  6  {0, . . . ,  N  -  1}  we  define  the  pay-off  corresponding  to  the  discrete 
time  dynamics  (3.1)  as 

(3.2)  J(r/,  tno,u)  =  At  £  fWv{tn),u{tn))e-x^-to)  +  ip(yv(T-,  u))e~W-**>) 

n=no 

and  for  n0  =  N  we  define, 

(3.3)  J(rj,tN)  =  ip{r]). 

The  corresponding  value  function  is 

(3.4)  v(rj,tno)  =  inf  J(7?,f„o,u),  n0. e  {O', ...,1V}. 

ueu 

We  will  construct  our  scheme  by  means  of  the  following  discrete  version  of  the  dyna¬ 
mic  programming  principle. 

Theorem  3.1.  Let  the  value  function  be  defined  by  (3 A).  Then , 

(3.5)  v(V,  tno)  =  inf  | At  £  /(%(<„),  u(tn))e-A(t’*-t”o)+ff(^(tp),ip)e-A^-4"o) 

U€U  I  71=710 

for  every  integer  p,  no  <  p  <  N. 

Proof.  Let  us  denote  by  w(r),  tno)  the  right-hand  side  of  (3.5).. 

We  consider  two  cases. 

1)  Let  p  =  N.  We  have 

(3.6)  w(r),tno)  =  inf  l At  £  /(^(t„),u(4))e-A(‘"-to)-f-v(^(T),T)e-A(T-t" ° 

UEU  {  71=710 

By  the  definitions  (3.4)  and  (3.2),  we  have 

v(yv(T),T)  =  iniJ(yv(T),T;u)  =  inf  ^(yp(T))  , 

uEU  u£JJ 

so  that  (3.6)  coincides  with  the  definition  of  value  function. 

2)  Let  p  <  N .  We  divide  the  proof  into  two  parts. 
a)  v(v>tno)  >  w{rj,tno). 

Let  us  fix  a  control  u,  by  (3.2)  we  have 


^  p- i 

J(V,tn o,«)  =  A /(%(*n),«(irl))e_A(t’1_t"o)  + 

71=710 

+  A /(yr/(in),w(4))e“A(T“t"o)  +^(^(T))e-A(T-t’*o)  . 

71— p 
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Defining  p  =  yv(tp;u),  the  uniqueness  of  the  solution  of  (3.1)  implies  yv(x,  tn\  u) 
Vp[x,tn\u)  for  every  tn  >  tp.  Then, 


J{rh^notu)  —  At  ^  /(^(in))  u(tn))e 


"A  (tn  ) 


+  A fESW^ye^'^’+^^Je  A(T  *"«)  = 


=  A tJ2  /(%(in),«(4))e_A(<"^o)  +  T(^;ip,w)e  x^  *"<>)  > 
n=no 

>  At  J2  /(%(^),«(<n))e"A(t"_tno)  +  v{^tp)e-^t”-tno) 


by  definition  (3.2)  and  (3.4).  Taking  the  infimum  over  U  we  have 


v{v,tno)  >  inf  At  £  +  Q(n,tp)e  Afe  t"°) 

ueU  I  n=no 


b)  v(y,tn J  <  w(rj,tno). 

Fi xu  eU  let  y  =  yn(tp ; it).  For  any  fixed  e  >  0  there  exists  a  control  uE  such  that 


v(n,tp)  +  e>  J{fi,tp,ue)  . 


Let  us  define  the  control 


u(x,tn)  = 


u{x,tn)  tno  <tn<  tp-i 
w(*D^n)  tp  ^  tn  T  . 


We  first  note  that  the  uniqueness  of  the  solution  of  (3.1)  implies 


l rr  t  ■  Ti\  —  I  yv(X>  tni  tno  —  tn  <  tp- 1 

yn[x,  tn,  u)  <  ^  tn.  tp  <tn  <T  . 


Recalling  the  definitions  u,  (3.2),  (3.8)  and  the  inequality  (3.7)  we  have 


v(y,tno)  <  J(y,tno,u)  =  At  ^  f(yv(t-,u)Mtn))e~xitn-t^  + 


+At  £  f(yv(t-,u),u{tn))e-x(-u  *"<>)  +  j;(yv(T))e 


-\(T  tn,Q  ) 


A.  Briani  and  M.  Falcone 


59 


=  At  J2  f{yv{t',u),u{tn))e  A(t"  tno)  + 

n=n o 

+A t  J2  f(%(t\ue),ue(tn))e~Ktn-tn<>)  +  ^(y^(T))e_A(T_t"o)  = 

n=p 

P-1  ^ 

=  A tJ2  f(yv(t-,u),u(tn))e-x{tn~tn o>  +  J(M,  ipi  < 

71=710 

p-i 

<  A t  £  /(j?7)(i;u),{t(in))e-A(t"-t"o)  +  (0(//,fp)  +  e)  e~A(t*-^. 

71=720 

For  e  tending  to  0  and  taking  the  infimum  over  u  £  U  we  end  the  proof.  □ 

Let  us  turn  now  to  the  proof  of  convergence.  The  basic  idea  is  simple:  coupling  an 
Euler  discretization  scheme  for  the  dynamics  with  a  quadrature  formula  (rectangles) 
for  the  cost  we  can  get  a  reasonable  approximation  of  the  value  function.  Two  main 
questions  have  to  be  clarified.  Which  conditions  on  the  two  discretization  schemes 
guarantee  the  convergence  to  the  value  function  of  our  approximation  scheme  ?  How 
accurate  is  that  discretization  scheme  ? 

To  obtain  results  in  both  directions  we  make  the  following  assumptions: 

(HI)  For  any  rj  £  K,  u  £  Uad,  0  <  At  <  T  -  t0  and  £  £  [h,T]  there  exist  u  £  U 
and  two  positive  constants  C\  and  C2,  such  that 

(3.9)  ||  %(£,«)  -yv(i,u)  ||<  Ci(At)2 

(3.10)  I A  t /(%(£), u(£))e~A(?_to)  -  f  f(yv(t),u(t))e~x{t~to)dd\<C2(At)2 

(H2)  For  any  y  £  K,  u  £  U,  0  <  At  <  T  — 10  and  £  £  [ t0,T  -  At],  there  exist 
u  £  Uad  and  two  positive  constants  C\,C2  such  that  (3.9)  and  (3.10)  hold. 

Note  that  the  constants  C\  and  C2  appearing  in  (3.9)  and  (3.10)  are  independent  of 
any  other  variable,  so  that  the  above  inequalities  provide  uniform  estimates  for  the 
time  discretization  of  the  dynamics  and  of  the  cost  functional. 

The  following  result  gives  an  estimate  of  the  L°°  error  related  to  our  time  dis¬ 
cretization.  The  proof  follows  the  lines  of  Theorem  3.1  in  Falcone-Ferretti  [11]  where 
a  similar  estimate  is  obtained  for  a  finite  dimensional  control  problem. 

Theorem  3.2.  Let  (HI),  (H2)  be  satisfied.  Then,  for  any  rj  £  K,  £  £  [to,T]  and 
0  <  At  <  T  —  t0,  there  exists  a  constant  C  >  0  such  that 

(3.11)  II  v(r],  £)-u(?7,£)  ||oo<  C  At . 

Proof.  Let  us  assume  that  there  exists  a  control  u  £  U  such  that  the  minimum  is 
attained  in  the  discrete  Dynamic  Programming  Principle  (if  not  the  same  proof  will 
work  with  slight  modifications).  Let  u  be  an  optimal  control  corresponding  to  u  such 
that  (H2)  holds  and  set  (3  =  e~AAi.  Note  that  in  the  following  calculations  we  do  not 
require  neither  u  nor  u  to  be  unique. 
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By  applying  Theorem  2.5  for  r  =  tp  and  Theorem  3.1  we  have 
v(r),Q -v(r),0  <  |  j (  f  dt  + 

(3-12)  -At  £  f(yv(tn)Mtn))e-^-to)\  + 

n= 0 

+  (3p{v(yv(tp,u),  tp)  -  v(yv(tp,  u),  tp)\. 


The  above  inequality  and  (H2)  (b)  imply 

v(r],£)  —  v(r],£)  <  pC2(At)2  -T  /3p[v(yv(tp,  u),  tp)  —  v(yri(tp,u),tp)]  + 

+  (3P[v(yy(tp,  u),tp)  —  v(jjri(tp ,  li),  tp)]  < 

<  pC2(Atf  +  (3PC  II  yr,(tp,u)  -  yr,(tp,u)  ||  + 

+  Pp[v(yv{tp,  u),  tp)  —  v(yr){tp,v),tp)  ] 

for  the  Lipschitz  continuity  of  the  value  function. 

Then,  assumption  (H2)(a)  implies 

v(rh  0  ~  v(r),  <  pC2(At)2  +  CCi(At)2 f3p  +  (3p[v(yv(tp,  u),tP)  —  v(yv(tp,  u),  tp)|  . 

Then  we  can  conclude  that 

(3.13)  (1  -  /?p)  sup  ( v(rj ,  t)  -  v(rj ,  t ))  <  pC2(At)2  +  CCi(At)2f3p  . 

vzl°° (n)  v  7 

telt0,T) 

In  the  same  way  one  can  prove  a  similar  inequality  for  sup  \v(r),t)  -  v(r],t)). 

„€Z,=»(fi)  v  ' 

is[*o  .T1 

In  conclusion,  we  get 


(3.14) 


||  v(V,T-t)-v(n,  T-t)  |U< 


PC2(Atf  +  CCi(At)2(3p 
1  -  Pp 


and  since  1  -  (3P  =  0[pAt )  this  ends  the  proof.  □ 

The  above  conditions  (HI)  and  (H2)  can  be  interpreted  as  assumptions  on  the 
order  of  approximation  of  the  time  discretization  for  the  dynamics  and  for  the  cost 
integral.  In  finite  dimensional  control  problems  one  can  also  obtain  sufficient  con¬ 
ditions  on  the  data  guaranteeing  (HI)  and  (H2)  (see  [11]).  The  inequality  (3.9)  is 
satisfied  if  the  discrete  dynamics  is  close  enough  to  the  continuous  dynamics  and 
this  of  course  will  depend  on  the  accuracy  of  the  approximation  scheme  and  on  the 
discretization  of  the  control  space  (note  that  we  are  taking  the  control  in  a  set  U 
which  stands  for  a  discretization  of  Uad)- 

Let  us  examine  in  more  detail  the  second  inequality  (3.10).  In  order  to  guarantee 
that  (3.10)  holds  true  for  the  approximation  of  our  parabolic  problem  we  need  to  know 
that  the  time  derivative  of  the  control  u  and  of  the  solution  of  the  heat  equation  exist. 
This  can  be  obtained  adding  some  regularity  assumptions  on  the  data  of  the  problem 
(see  f.e.  Theorem  2  in  [12,  p.  144]). 
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For  example,  let  us  assume  that  /  is  Lipschitz  continuous  with  respect  to  the 
couple  (y,  u)  and  that  the  space  U  is  such  that  for  any  u  £  Uad  there  exists  at  least 
one  control  u  £  U  guaranteeing 

||u  —  ■u|U2([o,rj)  <  CAf 
for  some  positive  constant  C.  Then, 

rAt\fm)MO)^-to)  -  f(yv(t),u(t))e-^dt\  < 

<  t+M  I -  f(yr,(t)Mt)))e-xi(-t0)\dt  + 

(3T5)  C  >  At 

+  [  \f(Vr,(t)Mm  |e_A(t_to)  -  e~x^\dt  < 

<  /  [Ci||u  —  2||£2([0>Tj)  +  C^At^dt  <  C(At)  . 

This  tells  us  that  it  is  important  to  built  an  accurate  discretization  of  the  control 
space  (f.e.  by  means  of  piecewise  polynomial  functions  of  time)  and  couple  this 
discretization  with  sufficiently  accurate  approximation  schemes  for  the  dynamics  and 
the  cost  to  get  the  error  bound  proved  in  Theorem  2.5. 

4.  Some  hints  for  the  algorithm 

In  order  to  solve  numerically  our  boundary  control  problem  we  use  a  local  version 
of  Dynamic  Programming  trying  to  reduce  the  huge  amount  of  computations  usually 
needed  by  that  approach.  Let  us  assume  that  we  want  to  compute  the  minimum 
over  a  subset  of  all  the  possible  trajectories,  f.e.  we  can  imagine  that  there  exists  a 
trajectory  (  starting  at  our  initial  condition  rj  and  reaching  a  neighbourhood  of  the 
final  state  z?. 

The  Bellman  optimality  principle  gives  the  characterization  of  the  value  function 
for  every  initial  condition  r)  £  H1^)  but,  in  order  to  have  a  feasible  algorithm,  we 
have  to  restrict  ourselves  to  a  compact  set  in  that  space.  One  possibility  is  to  restrict 
the  analysis  to  a  neighbourhood  of  (  enforcing  some  state  constraints  and  to  deal 
with  the  Hamilton- Jacobi  equation  associated  to  the  infinite  dimensional  problem 
with  state  constraints.  At  present  the  theory  and  the  numerical  methods  for  such 
problems  in  infinite  dimension  seem  to  be  rather  incomplete  and  unsatisfactory  so 
we  prefer  to  attack  the  problem  by  means  of  a  penalization  method.  In  practice,  we 
add  to  the  pay-off  a  (penalization)  term  rapidly  growing  outside  the  tube  around  C- 

Our  algorithm  to  compute  an  approximate  locally  optimal  trajectory  will  be  di¬ 
vided  into  two  parts.  At  first,  in  the  backward  procedure  (from  T  to  to),  we  compute 
a  sequence  of  almost  optimal  controls  and  states  guaranteeing  the  final  condition  and 
the  state  constraints.  Then,  in  the  forward  procedure  we  actually  solve  our  problem 
using  the  informations  obtained  in  the  backward  steps. 

Let  AT  be  a  positive  integer  and  let  At  =  (T  —  to)/N,  as  in  Section  3  we  consider 
a  discretization  with  time  step  At  and  we  define  tn  =  to  +  nAt.  In  order  to  simplify 
let  us  assume  that  there  are  only  to  different  controls,  i.e.  U  =  {ui, . . .  ,um}.  Note 
that  we  can  always  construct  a  discretization  of  the  control  space  Uad  leading  to  that 


62 


A  Priori  Estimates  for  the  Approximation  of  a  Parabolic  Boundary  Control  Problem 


Figure  4.1.  A  sketch  of  the  backward- forward  procedure. 

situation,  f.e.  using  piecewice  constant  or  piecewice  linear  functions  to  approximate 
an  element  in  Uad-  The  set  U  is  the  finite  dimensional  representation  of  our  original 
control  space.  A  simple  choice  is  to  take  controls  which  are  constant  on  each  time 
interval  In  =  (tn,tn+i). 

Figure  1  gives  a  sketch  of  the  algorithm.  The  thick  solid  line  represents  the  trajec¬ 
tory  (  while  the  area  contained  between  the  two  dotted  lines  represents  the  tube. 

The  backward  procedure  works  as  follows.  Starting  from  the  final  condition  zT,  we 
determine  a  new  trajectory  (possibly  different  from  £)  joining  Zt  to  an  initial  state 
(possibly  different  from  g)  at  time  tQ.  More  precisely,  starting  from  gN  =  zT,  at  each 
time  step  we  take  the  solution  gn+1  of  the  dynamics  at  time  tn+ 1,  n  =  N  —  1, 0,  as 
our  target  and  we  want  to  determine  a  state  gn  and  a  control  u  at  time  tn  such  that 
we  minimize  the  running  cost  related  to  that  displacement.  The  sequence  of  all  the 
intermediate  targets  gn  will  give  us  the  new  “trajectory”  (the  dashed  line  in  Figure 
1)  to  be  used  in  the  forward  procedure. 

In  fact,  in  the  forward  procedure  starting  from  the  initial  condition  rj  of  the  original 
problem  we  try  to  pass  through  the  intermediate  targets  </",  n  =  1, ..,  N,  always  min¬ 
imizing  the  costs  over  U.  Since  the  cost  functional  depends  continuosly  on  the  initial 
conditions  (see  [17]),  we  will  obtain  a  quasi-optimal  control  and  a  good  approxima¬ 
tion  (the  thin  solid  line  in  Figure  1)  of  the  solution  provided  the  discretization  is 
sufficiently  accurate. 

The  fully  discrete  algorithm  requires  a  further  discretization  in  space.  This  means 
that  in  the  numerical  solution  of  the  boundary  control  problem  one  has  to  couple 
an  approximation  scheme  for  the  dynamics  (which  can  be  a  black-box  solving  the 
heat  equation  for  any  initial  condition  and  piecewise  constant  boundary  control) 
and  an  approximation  scheme  for  the  cost  functional  (which  can  also  be  a  library 
routine  for  the  numerical  integration  over  ft).  In  the  backward  procedure  the  black¬ 
box  is  used  to  compute  the  solutions  corresponding  to  a  finite  number  of  initial 
conditions  and  controls,  comparing  the  results  of  the  integral  on  each  of  them  one 
can  get  a  couple  ( gn ,  un)  giving  the  optimal  discrete  value.  The  forward  procedure 
uses  the  knowledge  of  the  “intermediate  targets”  to  solve  a  sequence  of  optimization 
problems  in  the  intervals  In.  In  each  of  this  optimization  problems  one  starts  from  the 
numerical  solution  at  time  tn  and  tries  to  reach  gn+1  minimizing  the  cost.  Note  that 
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the  algorithm  computes  local  minima.  So  in  general  we  will  obtain  an  approximation 
of  the  optimal  solution  in  the  tube.  Only  if  our  tube  contains  the  optimal  solution 
for  the  unconstrained  optimal  control  problem  we  would  expect  to  converge  to  the 
globally  optimal  solution. 


5.  Appendix 

1.  We  give  for  completeness  the  proof  of  the  Dynamic  Programming  Principle 
(Theorem  2.5)  for  our  problem. 

Proof.  Let  us  denote  by  w(rj,t0)  the  right  hand  side  of  (2.23). 

We  consider  two  cases. 

1)  Let  t  =  T.  We  first  observe  that,  by  definition, 

v(yv(T),T)  =  miJ(yv(T)). 

We  have  then 

u/fo,  to)  =  inf  j£  f(yv(t;  u)Xt))e-X(t-to)dt  +  v(yv(T),T)e-x^  }  = 

=  inf  {£  «),  u(t))e~x^dt  +  V(i yv(T))e~x ^  } 

which  coincides  with  the  definition  of  the  value  function. 

2)  Let  t  <  T.  We  will  show  the  two  inequalities: 

a)  v(r),t0 )  >  w(r),t0). 

For  any  fixed  u  €  U,  we  can  write 

J(r),t0,u)=  [  f(yv(t;u),u(t))e^x{t~to}dt  + 

Jto 

+  ^7V(%(i;n),«W)e-A(t-to)dt  +  ^(%(T))e-A(T-to)  . 

Since  the  solution  of  the  heat  equation  is  unique,  setting  y,  =  yv{r;u)  we  have 
yv{t’,u)  =  y^pu),  for  every  t  >  r.  Then  the  definitions  of  the  cost  function  and 
of  the  value  function  imply 

J{ V,  to,  u)  =  [  f{yv{t-,  u),  u(f))e“A(t_to)dt  + 

Jto 

+  e-A(T-t0){  [  f(y,(t;  u),  u(t))e~x^dt  +  ,K^m)e-A(T-^}  = 

=  f  KVvit ;  u),  u{t))<rx(t-to)dt  +  e-A(r_<o)  J(^(r), r,  u)  > 

Jto 

>  [  f{yr,{t-,u),u{i))e-x{t~to)dt  +  e"A(T"to)u(?/M(r),T). 

Jto 

Taking  the  infimum  over  u  e  U,  we  prove  our  first  inequality. 

b )  v(rj,t0)  <  w(rj,t0).  For  any  e  >  0  there  exists  ue  G  U  such  that 

(5.1)  v(ij,t)+£>  J(h,t,u£) 

where  y  =  yv{T, u )  and  u  is  fixed. 
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Define 


u(x,t) 


u(x,  t )  to  <  t  <  T 
ue(x,t)  t  <t  <T. 


Then,  by  definition, 

v(r),t0)  <  J(r),t0,u)  =  f  /(&?(i;M),u(f))e~A(t“to)dt-+ 

Jto 

j '  u),u(t))e~x{t~to)dt  +  ip(yr,(T))( 


(5.2) 


+ 


A(T— to) 


The  uniqueness  of  the  solution  for  (2.1)  and  the  definition  of  u,  imply 


yr)(x,t-,u) 


yn(x,t;u)  t0<t<r 
yfj.(x,t\  u£)  t  <  t  <T  . 


By  substitution  in  (5.2),  we  get 

v(y,tQ)  <  f  f(yv(t;u),u(t))e~x{t~to)dt  + 

•/to 

+  [  f%{t\ «e),  u£(t))e~Xit-t0)dt  +  = 

=  /  f{yr,(t;u},u(t))e~x{t-to)dt  +  e~xiT~to){J(iJ,,T,ue)}  < 

Jto 

<  [  f(yv(t]u),u(t))e~x{t~to)dt  +  e~A(T“*o)  (w(//,r)  +  e)  . 

Jto 

Taking  the  infimum  over  u  £  U,  by  the  arbitrariness  of  £  we  get  the  reverse 
inequality.  This  ends  the  proof.  □ 


Note  that  one  of  the  crucial  requirements  for  the  proof  is  the  fact  that  the  set  of 
controls  is  “closed  by  concatenation”,  i.e.  if  two  controls  Ui  and  belong  to  Uaci 
then,  for  any  r  £  [to,T],  also  the  control 


u(x,t) 


ui(x,t)  to  <  t  <  T 
u2(x,t)  t  <t  <T 


belongs  to  the  same  space  Uad.  As  a  consequence  the  Dynamic  Programming  Principle 
holds  also  under  the  restrictions  (2.26)  and  (2.27)  on  the  controls  and  on  the  initial 
data. 


2.  Here  is  the  statement  in  Corollary  III. 20  in  [6] 


Theorem  5.1.  Let  E  be  a  reflexive  Banach  space,  A  C  E  be  a  closed,  bounded,  convex 
subset  and  the  function  (f  :  A  — >]  —  oo,  Too]  be  convex  and  lower  semicontinuous. 
Then,  there  exists  xo  £  A  such  that  <p(x0)  =  min  <f>(x) . 
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Abstract.  We  formally  state  and  prove  the  wellposedness  and  the  local  Lipschitz  con¬ 
tinuity  of  the  multisurface  stress-strain  law  of  nonlinear  kinematic  hardening  type  due  to 
Chaboche  within  the  space  of  time-dependent  tensor-valued  absolutely  continuous  func¬ 
tions.  The  results  also  include  the  more  general  case  of  a  continuous  family  of  auxiliary 
surfaces. 

1991  Mathematics  Subject  Classification.  47H30,  73E05 
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1.  Introduction 

In  rate  independent  plasticity,  the  Prandtl-Reufi  model  constitutes  the  basic  model 
for  the  stress-strain  law.  Here,  the  elastic  region  Z  is  bounded  by  a  yield  surface 
dZ .  Throughout  this  paper,  we  will  assume  the  yield  surface  to  be  a  sphere  of  radius 
r  in  the  space  of  deviatoric  stresses.  If  loading  occurs  while  the  stress  deviator  ad 
lies  on  the  yield  surface,  there  is  plastic  flow  with  a  plastic  strain  rate  ip  proportional 
to  the  outer  normal  to  dZ  in  ad .  It  has  been  known  from  experiments  for  a  long 
time  that  for  many  materials  the  yield  surface  undergoes  changes  which  depend  upon 
the  history  of  the  loading  process.  In  the  Melan-Prager  model  which  dates  back  to 
[12],  [13],  nowadays  called  linear  kinematic  hardening,  the  yield  surface  moves  during 
plastic  loading  in  the  direction  of  the  plastic  strain  rate.  More  sophisticated  models 
have  been  developed  to  account  for  real  material  behaviour,  in  particular  for  the 
phenomenon  called  ratchetting.  Among  those,  the  Chaboche  model  [10],  also  called 
nonlinear  kinematic  hardening,  enjoys  a  widespread  popularity.  In  its  standard  form, 
it  employs  a  finite  family  of  auxiliary  spherical  surfaces.  In  the  special  case  of  a  single 
auxiliary  surface,  assumed  to  be  centered  at  0  with  radius  R ,  the  model  is  known 
as  the  Armstrong-Frederick  model  [1];  here,  the  center  ab  of  the  yield  surface,  also 
termed  the  backstress,  moves  according  to  the  differential  equation 

(1.1)  &b  =  j(Rip-ab\ip\)  , 
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for  some  constant  7  >  0 ,  see  Figure  1.  (In  the  Melan-Prager  model,  the  term  -ab\ip 
is  omitted.) 


In  the  Chaboche  model,  the  backstress  ab  is  decomposed  into  a  sum 

(1-2)  ob  =  Y,aki 

kei 

where  each  constituent  ak  satisfies  an  equation  of  type  (1.1),  namely 
(1.3)  ab  =  1(k)(R(k)ep-ab\ip\)  ,  he  I. 

In  the  standard  Chaboche  model,  the  index  set  I  is  finite;  we  will  allow  an  arbitrary 
measure  space  and  thus  include  the  case  of  a  continuous  family  of  auxiliary  surfaces. 

Figure  2  shows  the  rheological  structure  of  the  model.  It  visualizes  the  relations 
between  the  various  variables  which  occur  in  the  model,  stated  formally  in  (2.5) 
-  (2.12)  below.  The  element  £  refers  to  the  linear  elastic  part,  TZ  is  called  the 
rigid  plastic  element  and  represents  the  variational  inequality,  and  Kk  is  the  element 
defined  by  (1.3).  The  element  C  plays  a  special  role;  it  stands  for  the  linear  element 
a1  =  Clsv  of  the  Melan-Prager  model.  It  may  or  may  not  be  included  within  the 
Chaboche  model,  but  its  presence  or  absence  influences  the  asymptotic  behaviour  (see 
e.g.  [7]).  If  we  remove  all  nonlinear  elements  Kk  in  Figure  2,  we  obtain  the  Melan- 
Prager  model.  If  we  moreover  delete  the  element  C ,  we  arrive  at  the  Prandtl-Reufl 
model. 

In  this  paper,  we  prove  that  the  Chaboche  model  is  well  posed  in  the  space  W1,1 
both  in  the  stress  controlled  and  in  the  strain  controlled  case  by  proving  that  the 
defining  equations  and  inequalities  of  the  Chaboche  model  (see  (2.5)-(2.12)  below) 
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lead  to  operators 

(1.4)  e  =  f{a) ,  a  =  Q{e) , 

which  are  well  defined  and  Lipschitz  continuous  on  their  appropriate  domains  of 
definition.  In  doing  this,  we  consider  the  stress-strain  law  in  isolation,  that  is,  we  do 
not  study  the  boundary  value  problems  which  arise  from  the  coupling  with  the  balance 
equations.  For  the  proof  we  utilize  the  method  of  [2].  There  we  have  introduced 
an  auxiliary  variable  u  in  order  to  reformulate  the  model  equations  such  that  the 
unknown  functions  of  Figure  2  appear  only  in  terms  of  |ep|  and  avd .  The  analysis 
is  based  on  the  concept  of  hysteresis  operators,  that  is,  of  operators  which  are  rate- 
independent  as  well  as  causal,  see  e.g.  [14],  [8],  [9],  [3]. 


jC  :  er,  a1 


11:  £p,  crp 

Figure  2:  The  rheological  structure  of  the  Chaboche  model. 


2.  Model  Formulation  and  Main  Result 

We  first  fix  some  basic  tensor  notation.  By  T ,  we  denote  the  space  of  symmetric 
N  x  N  tensors  endowed  with  the  usual  scalar  product  and  the  associated  norm 


*j= 1 

For  t  G  T ,  we  define  its  trace  Tr  r  and  its  deviator  rd  by 

N  Trr 

(2.2)  Ttt  =  =  (t,S)  ,  Td  =  T-—  6, 

i= 1 

where  6  =  ( 6% )  stands  for  the  Kronecker  symbol.  We  denote  by 

(2.3)  Td  =  (r  :  t  €  T ,  Tr  t  =  0}  ,  =  {r  :  r  =  X6 ,  A  G  M)  , 
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the  space  of  all  deviators  respectively  its  orthogonal  complement.  We  understand 
stress  and  strain  as  time-dependent  tensor-valued  functions  which  are  absolutely 
continuous, 

(2.4)  a,  e  G  W1,1(t0>ti;T) 

:=  {r|r  :  [f0,ti]  -*•  T ,  |M|i,i  =  |'r(*o)|  +  /  *  |f(t)|  dt  <  00}  . 

As  we  study  the  stress-strain  law  in  isolation,  we  do  not  consider  the  space  depen¬ 
dence.  In  this  terminology,  the  Chaboche  model  takes  on  the  form 


(2.5) 

a  =  ab  +  ap,  e  =  £e  +  ep ,  ep(t)  G  Td  Vt 

(2.6) 

(£p,ng-d)>0,  VdGTd,  \d\  <r, 

(2.7) 

v. 

VI 

(2.8) 

cr  =  Aee , 

(2.9) 

crh(t)  =  J  <rbk{t )  di/(fc)  +  vla\t)  V  t , 

(2.10) 

=  7(fc)  ( R{k)ip  -  Ufc|ep|)  ,  for  all  k  G  / , 

(2.11) 

a1  =  Clep, 

(2.12) 

ap(t0)  =  erg  ,  <r£ (t0)  =  ao(k) ,  for  all  k  G  / 

Throughout  this  paper,  we  assume  the  data  to  have  the  following  properties. 

Assumption  2.1. 

(i)  I  is  a  measure  space,  v  is  a  finite  nonnegative  measure  on  I ,  the  numbers  ir,  Cl 
and  functions  R  G  L\,{I) ,  7  G  L“(/)  satisfy  vl,R,  7  >  0,  Cl  >  0,  fj  R{k)du{k)  > 
0  and 

(2.13)  0  <  7 min  <  7 (k)  <  7max  ,  for  all  k  G  /  . 

(ii)  The  initial  values  in  (2.12)  satisfy 

(2.14)  ug  G  T£  =  {t  :  r  G  T,  |rd|  <  r}  , 

(2.15)  <Jb0eTb  =  {/|/  G  Li(/;  Td) ,  |/(fc)|  <  R(k)  a.e.}  . 

(Hi)  A  :  T  — *  T  is  linear,  symmetric  and  positive  definite. 

We  also  introduce  the  constants 

(2.16)  T  i  =  f1{k)iR(k)dv(k),  *  =  0,1, 2, 3. 
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Remark  2.2. 

(i)  If  the  index  set  I  is  finite,  say  /  =  {1, . . .  ,K},  and  if  v  is  chosen  to  be  the 
counting  measure,  that  is,  v{  J)  equals  the  number  of  elements  in  J  for  every  subset 
J  of  / ,  then  we  obtain  the  standard  formulation  of  the  multisurface  Chaboche  model 
with  K  auxiliary  (limiting)  surfaces,  namely 

(2.17)  ^  =  X>£- 

fc=i 

In  this  case,  the  model  (2.5)  -  (2.12)  is  identical  with  the  one  discussed  in  ([10], 
Section  5.4.4),  nonlinear  kinematic  case,  if  we  change  the  notation  according  to 

(2.18)  k  =  l,  a bk=Xi,  7(fc)  =  ^| 'y(k)R(k)  =  |ci . 

(ii)  If  we  have  K  =  1  in  (i),  or  if  we  choose  7 (k)  =  7  and  R(k)  =  R/v(I)  to  be 
constant,  the  Chaboche  model  reduces  to  the  model  of  Armstrong  and  Frederick  [1] 

(2.19)  &b  =  j(Rip  -  ab\ep\) . 

(iii)  If  dv(k)  =  g(k)d\(k)  for  some  function  g ,  that  is,  if  the  measure  v  has  a  density 
with  respect  to  the  Lebesgue  measure  A ,  we  obtain  a  version  of  the  Chaboche  model 
with  a  continuous  one  parameter  family  of  backstresses  respectively  auxiliary  surfaces. 

We  formulate  our  main  results.  For  the  strain  controlled  case,  we  assume  Hooke’s 
law  for  the  linear. elastic  part,  that  is, 

(2.20)  Ae  =  2ge  +  ATY  (e)<5 , 
where  A,  g  >  0  denote  the  Lame  constants. 

Theorem  2.3  (Wellposedness,  Strain  Controlled  Case). 

Let  Assumption  2.1  as  well  as  (2.2Q)  hold.  Then  the  system  (2.5)-(2.12)  defines  an 
operator 

(2.21)  o  —  Q{s\  o’q,  u”q)  , 

(2.22)  g  :  W^ito,  tu  T)  x  I?  x  Tb  -»  W^{t0,  ti;  T) , 
which  satisfies  the  Lipschitz  condition 

(2.23)  ||  G(e;  erg,  erg)  -  Q(e;  og,  dg) 

<  L{K)  (||  e  -  e  +  |crg  -  dg|  +  ||ag  -  dg||Li(/;Td))  , 

where  the  Lipschitz  constant  is  uniform  over  subsets  {(e, (To,crg)  :  || er  x  <  K}  of 
the  domain  of  definition  of  Q  . 

We  now  consider  the  stress  controlled  case.  If  vl  =  0 ,  that  is,  if  the  Melan-Prager 
element  is  absent,  our  choice  (2.12)  of  initial  conditions  restricts  the  initial  value 
<7 (t0)  of  the  stress;  on  the  other  hand,  there  has  to  be  an  initial  condition 

(2.24)  ep(f0)  =  eva 
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for  the  plastic  strain.  This  setting  also  works  for  the  case  vl  >  0 ,  the  restriction 
being 


a(t0)  =  <  + 1  (k)dis(k)  +  ulCl£p0  . 


In  the  case  ul  =  0 ,  the  description  of  the  domains  where  the  Lipschitz  constant  is 
uniform  involves  the  number 

(2.26)  +  !>=• 

7min  1  2 

Theorem  2.4  (Wellposedness,  Stress  Controlled  Case). 

Let  Assumption  2.1  hold. 

(Case  vl  >  0.)  The  system  (2.5)-(2.12),  (2.24)  defines  an  operator 

(2.27)  £  =  f(a;ag,*b0,£p0),  T  :  Da  -+  W1'1(f0,*i;T) , 

where  Da  C  X;T)  xl’xTxTj  is  the  subset  of  quadruples  which  satisfy 

(2.25).  Moreover,  fF  satisfies  on  Da  the  Lipschitz  condition 

(2.28)  II  Her,  erg,  erg,  eg)  -  -F(d;  dg,  dg,  eg)  ||1(1 

<  L(K)  (||  CT  —  d  ||M  +  |crg  -  dg|  +  ||erg  -  ^olUl(/;Td)  +  l£0  ~  £ol)  > 


where  the  Lipschitz  constant  is  uniform  over  subsets  {(u,  erg,  erg,  eg)  :  Herll^j  <  K} 
of  the  domain  of  definition  of  T  ■ 

(Case  ul  =  0.)  For  every  k  >  0 ,  let  DaA  be  the  subset  of  Da  where  the  two 
conditions 


J  7(/c)erg(fc)eh/(fc)  <  T^l  —  k)  , 


(2.30)  ||  crd  <  T0  +  r  -  Ti(3k  , 

hold,  the  number  (5  being  defined  in  (2.26).  Then  T  has  the  properties  as  stated 
above  on  the  domains  D0iK  instead  of  Da ;  in  particular,  the  Lipschitz  constant  also 
depends  on  k  . 

A  well  known  example  (see  [10],  or  Example  3.5  in  [2])  shows  that  the  bound 
II  ad  Hoc  <  T0  +  r  in  (2.30)  cannot  be  improved. 

The  basic  idea  of  the  proof  of  the  two  theorems  above  is  the  same  as  in  [2].  We 
replace  the  two  unknown  functions  ep  and  erg  by  a  single  auxiliary  function  u , 
namely 

(2.31)  u  =  Cep  +  erg , 

where  C  >  0  is  a  suitably  chosen  constant.  In  fact,  both  functions  ep  and  erg  can 
be  expressed  as 

(2.32)  ep  =  u  ;  ergd) ,  erg  =  S(u ;  crgd) . 
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Here,  the  stop  operator  S  represents  the  solution  of  the  evolution  variational  in¬ 
equality 

(2.33)  K|<r,  {u-opd,ovd-c)  >  0  a.e.  V|a|<r, 

with  the  initial  condition 
(2-34)  Od(fo)  =  &od ) 

and  the  play  operator  V  is  defined  by 

(2.35)  V{u\ald)=u-S(u-ald). 

We  refer  to  [2]  and  [9]  for  more  details.  We  now  derive  a  differential  equation  for  u 
where  the  internal  variables  crl ,  ed,  ep  appear  only  in  terms  of  crd  and  \ip\ .  In 
the  stress  controlled  case,  we  set 

(2.36)  C  =  I\  +  vlCl . 

Using  the  model  equations,  we  obtain 

(2.37)  u  =  (rx  +  ulCl)ip  +  &pd  =  (r\  +  vlCl)ep  +  &d-  ab 

=  Od,  + f^Woidvtk)^. 

In  the  strain  controlled  case,  where  we  have  assumed  Hooke’s  law  (2.20)  for  the  linear 
elastic  part,  the  backstress  ab  satisfies 

(2.38)  ad  =  2ned ,  ab  =  2 fj,ed  —  (2 fi£p  +  avd) . 

Here,  we  set  the  constant  C  in  (2.31)  to 

(2.39)  C  =  2fJ,  +  r1  +  vlCl , 
and  obtain 

(2.40)  u  =  (2li  +  rl  +  plCl)ip  +  &pd  =  2fj,ip  +  2nied-ab  +  {r1+i/Cl)£p 

=  2fi£d  +  J  l{k)crbk  dv(k)  |ep| . 

As  it  is  well  known,  one  can  easily  eliminate  the  unknowns  'with  the  variations  of 
constants  formula.  Using  the  basic  identity 

P 

(2.41)  ip  =  —  |ep| , 

r 

the  differential  equation  (2.10)  for  the  backstresses  becomes 

(2.42)  ^  =  kel- 

For  later  use,  we  will  write  down  the  solution  formula  in  terms  of  the  play  and  stop 
operator  with  the  abbreviated  notation 

(2.43)  £  =  V{u]apad) ,  x  =  S(u;cr%d),  £,  x  :  [t0,  H]  -»  Td . 
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The  function 

(2.44)  V(t)  =  Var[w|£  (=  jf  l&OI  dr,  if  £  e  W1-1(f0,ti;Td)) 
represents  the  accumulated  plastic  strain,  scaled  by  a  constant  factor.  If  we  set 

(2.45)  '  Wk(t)  =  exp  , 

the  backstresses  can  be  expressed  as 

(2.46)  al(t)  =  exp  (-2®  V(l))  (<4(‘)  +  £  “W)  ■ 

Thus,  for  the  stress  as  well  as  for  the  strain  controlled  case,  the  auxiliary  function  u 
satisfies  the  equation 

(2.47)  u  =  9  +  M(u;a%,ab0)\i\, 
where  6  =  04  respectively  6  =  2 \iei , 

(2.48)  M{u\  erg,  ab0)(t)  =  i  jf  l(k)obk{t)  du{k ) , 

and  (2.43)  -  (2.46)  are  used  to  express  er|  in  terms  of  the  arguments  of  M  .  Equation 
(2.47)  is  complemented  by  the  initial  condition 

(2.49)  u(to)  =  Cep(to)  +  crpd  . 

In  the  stress  controlled  case,  ep{t0 )  is  prescribed,  whereas  in  the  strain  controlled 
case,  it  can  be  expressed  in  terms  of  the  given  data  by  (2.38). 

Once  the  auxiliary  equation  (2.47)  is  solved,  we  can  express  the  operators  F,  Q  in 
terms  of  u ,  namely 

(2.50)  £  =  Tip ,  erg,  erg,  eg)  =  se  +  ep  =  A~ la  +  ^P(u ;  crgd) , 

2u 

(2.51)  cr  =  Q{e,  erg,  <rg)  =  A{e  -  ep)  =  As  -  i  tfd)  ■ 

3.  Proof  of  the  Wellposedness 

The  wellposedness  of  the  initial  value  problem 

(3.1)  ii(t)  =  0(t)  +  M(u;a^a^)(t)li(t)l,  £(f)  =  P(u;o%d)(t) , 

(3.2)  u(t0)  =  u° . 

has  been  studied  in  [2]  concerning  the  dependence  on  9 ;  the  dependence  on  the  initial 
conditions  ( u° ,  erg,  erg )  does  not  pose  any  new  problems.  For  the  convenience  of  the 
reader,  we  repeat  the  formulation  of  the  existence  theorem,  adapted  to  the  present 


case. 
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Theorem  3.1.  Let  9  G  ,  u°  G  Td  and  an  operator 

(3.3)  M  :  C([to,ii];Td)  x  Tf  x  Tb  -> 

be  given.  Assume  that  A4(-;  aft,  <Jq)  is  causal  and  continuous  with  respect  to  the 
maximum  norm  for  all  erg  G  TT£  and  Ug  G  T6 ,  and  that  k  >  0, Og  G  1^,  dg  G  T6  and 
u°  G  Td  are  given  suc/i  that 

(3.4)  sup  \M(u\  erg,  <7q)|  <  1  —  k 

rejto.t ] 

holds  for  all  t  G  [to,fi]  and  all  u  G  W1,1(fo,t;Td)  with  u(to)  =  u°  and 

(3.5)  |u(r)|  <  —  |0(r)| ,  a.e.  in  (to,t) . 

AC 

TTien  there  exists  a  solution  (v,  £)  of  the  Cauchy  problem  (3.1),  (3.2)  where  the 
functions  u,  £  G  W'1,1(t0,t1; Td)  fulfil  (3A)  and  (3.5).  Moreover,  every  such  solution 
which  satisfies  (3.4)  also  satisfies  (3.5). 

Proof.  See  [2],  Theorem  3.2.  □ 

Lemma  3.2.  The  operator  M(-\  <7g,  <Tg)  as  defined  in  ( 2.43 )  -  ( 2.48 )  is  causal  and 
continuous  on  (7([fg,  fi] ;  Td)  for  all  <7g  G  T£  and  Ug  G  Tb .  The  backstresses  ok 
satisfy  the  a  priori  estimate 

(3.6)  \obk(t)\  <  R(k) ,  a.e.  in  (to,  ti) , 
for  all  k  G  / . 

Proof.  The  estimate  (3.6)  follows  from  the  variations  of  constants  formula  (2.46),  since 
|a;(£)|  <  r  and  |<r^(/c) |  <  R(k)  hold  for  all  t  and  k.  Let  now  un  G  C([t0, ti]; Td) 
converge  uniformly  to  u  G  C'([to,ti];Td) .  It  is  known  (see  [9])  that 

(3.7)  £n  =  P(un  ;  cr£d)  ->  Z  =  V(u;  ogd) ,  xn  =  S{un ;  ap0d)  x  =  S{u]  o%d) , 


(3-8)  Vn(t)  =  Var[t0)t]£n  ->  V(t)  =  Var[(0it]£ , 

uniformly  on  [to,fi] .  An  application  of  Lebesgue’s  dominated  convergence  theorem 
yields  the  assertion.  □ 

We  now  discuss  the  boundedness  property  (3.4).  By  the  definition  of  M.  in  (2.48), 
the  estimate  (3.6)  yields 


(3.9) 


||  M(u;ag,a0)  ^  <  - 


ri 


c  ’ 


so  (3.4)  holds  for  all  arguments,  regardless  of  (3.5),  with 


(3.10) 


vlCl 


K  = 


respectively  k  = 


2p  +  vlCl 


c  ’  . "  C 

in  the  stress  respectively  strain  controlled  case.  Thus,  the  existence  of  a  solution  of 
(3.1),  (3.2)  follows  for  the  strain  controlled  case  and,  if  in  addition  vl  >  0,  also  for 
the  stress  controlled  case. 
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Existence  proof;  for  the  stress  controlled  case  with  vl  =  0.  Let  k  >  0.  According 
to  Thcorem  J2.4,  .we  want  to  prove  existence  for  initial  conditions  satisfying 

(3.11)  a(t0)  -=  eg  +  J  ab0{k)du(k ) , 

(3.12)  J i(k)do(k)di>(k)^  <  r\(l  -  k)  , 
and  for  stress:inputs  ad  =  0  €  W1;1(ti o,ti;Td)  satisfying 

(3.13)  II  °d  Hoc  <  T0  +  r  -  Ttfn ,  p=^-  +  r-^. 

Let  such  a  <jd  !be  given,  choose  rj  >  0  small  enough  such  that 
.pt+ri  K2r 

(3.14)  /  \<rd{r)\dT  <  - - ,  V  t  6  [t0,  h  -  r?] . 

J  t  o'Yxnax 

In  the  first  step  we  will  prove  that,  if  we  have  a  solution  u  of  (3.1),  (3.2)  satisfying 
(3.4)  on  [t0,  o].,  then  it  can  be  extended  to  [a,  a  +  r)},  and  every  such  extension  u 
satisfies 

(3.15)  ||A4(u;ct?,<x£)||o0<1-^ 
on  [a,  a  +  rj\,  and 

2 

(3.16)  ||«(t)|  <  -|dd(t)| ,  a.e.  on  (0,0  +  77). 

K 

To  this  end,  let  iu  £  a  +  r, ?;  Td)  be  an  arbitrary  function  which  satisfies  (3.16) 

as  well  as  u(a)  =  setting  u  =  u  on  [t0,  a]  we  may  regard  it  as  an  element  of 
W1'1(t0,a  +  «s  well.  Prom  the  variation  of  constants  formula  (2.46),  applied 

on  the  interval  f  f),,  we  obtain 

(3.17)  |ofti)  -4(41 

<  ^1  — exp  -  V(a^  j  *  (|^(o)|  +  R(k)) ,  t  e  [a,  a  +  rfi , 

for  the  corresponding  baekstresses.  Since 

(3.18)  \V( t)-V{a)\<f  \*ir)\dr, 

J  a 

we  get 

(3.19)  \crk{t)  —  rrjfc(a)|  <  2R{kj^  £  |4(r)|  dr  <  Mr)|  dr 

<  %m, 

so 

(3.20)  \M{u-,al,ab0){t)  -  M{u]al,ab0){a)\  <  |. 

Thus,  the  assumption  |A4(u;ao,0o)(a)|  <  1— k  implies  that  (3.15)  holds  if  u  satisfies 
(3.16).  We  may  therefore  apply  Theorem  3.1  on  the  interval  [a, a  +  rj\  to  conclude 
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the  first  step  of  the  proof.  In  the  second  step,  we  use  (3.13)  to  show  that  (3.15)  can 
be  improved  to 

(3.21)  \M{u;aQ,<jQ)(t)\  <  1  —  k,  \f  t  £[a,a  +  r}]. 

In  fact,  if  (3.21)  does  not  hold,  then  there  must  exist  a  t  £  (a,  a  +  rj)  such  that 

(3.22)  \M(u-,a%,<Tb0)(t)\  >  1  -k,  ^  (|M(u; erg, ^)(t)|2)  >  0 . 

Let  us  define 

(3.23)  a  =  |At(u;<jg,f7o)(t)| ,  e  =  ^M(u\ol,<jb0)(t) 
then  obviously 

(3.24)  0  <  1  —  a  <  k,  |e|  =  1 . 

The  choice  of  t  implies  that 

(3.25)  0  <  ~  (|A4(fi;oS,ffJJ)(f)|2)  =  ^-Jy(k){&bk{t),e)dv{k) 

=  |rl£(t)|  1 7 (k)*{y^-x(t)  -  abk(t),e)  du(k) , 
so  in  particular  |£(f)|  >0  and  therefore 

(3.26)  Jy(k)2(crbk{t),e)dv(k)  <  y(x{t),e) 

=  y  e)  -  jyk{t), e)  dv{k)sj  , 

hence 

(3.27)  jf  (l{kf  +  y)  (<fk(t),e)  dv{k )  <  y  ||  ad  . 

On  the  other  hand,  the  a  priori  estimate  |cr£(t)|  <  R(k)  shows  that 

(3.28)  0  <  1  —  a  =  —  J  l{k){R{k)e  -  ak(t ),  e)  dv(k)  <  k  , 
hence  the  definition  of  f3  in  (3.13)  yields 

(3-29)  j  (i  +  |r7(&)2)  (R(k)e  -  abk(t),  e)  dv{k) 

<  (3  /  7 (k)(R(k)e  —  crbk(t),  e)  dv{k)  <  (3T\k  , 

and  therefore 

(3.30)  ||  <rd  IU  >  /  (l  +  ^7(*)2)  R(k)  dv(k ) 

-  J  (f  +  frrr(k)2'')  (R(k)e  -  crbk(t),  e)  dv{k)  >T0  +  r-  (3Yik  , 
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which  contradicts  our  assumption  (3.13).  Thus,  such  a  t  cannot  exist,  and  the  second 
step  is  proved.  Applying  the  two  steps  in  an  alternate  fashion  we  are  able  to  cover 
the  whole  interval  [t0,ti] ,  thus  completing  the  existence  proof.  □ 

Proof  of  uniqueness  and  Lipschitz  continuous  dependence.  We  combine  a  Gronwall 
type  argument  with  the  Lipschitz  continuity  property  of  the  hysteresis  operators  V 
and  S .  As  the  arguments  are  essentially  the  same  as  for  the  single  surface  case,  i.e. 
the  model  of  Armstrong  and  Frederick,  we  can  use  the  results  of  [2]  to  a  large  extent. 

Proposition  3.3.  Let  two  sets  of  data  (Q\,  of0,  a\0) ,  (92,  u2,  °20>  °2o)  w 9i  E& , 
iiJeX,  <jf0  e  T£  and  of0  €  Tb  be  given,  let  («i,fi)  and  (u2,6)  be  corresponding 
solutions  in  W1,1(to>£i;Td)  of  the  Cauchy  problem  (3.1),  (3-2)  which  satisfy  (3 A) 
and  (3.5).  Assume  that 

(3.31)  max  of0,  uj0)(s)  -  ■A4(u2;<4)>02o)(s)l 

se\to(t] 

<  A^Ko  -  <41  +  Ikio  -  a2ohl(.iJd)  +  lu?  -  ul\  +  Jto  l«i  -  «al  rfs) 

holds  for  all  t  E  [toAi]  •  Then  there  holds 

(3.32)  ||  Ui  —  U2  ||^i 

<  L  (|u?  -  u\ I  +  Ko  -  <41  +  ll<4  -  °2olUi(r;T d)  +  II  ^1  -  lli.i)  , 

where  L  depends  only  upon  A,  K,r  and 

(3.33)  c  :=  max{||  9\  ||M,  ||02|li,i}- 

Proof.  See  Theorem  3.3  in  [2] .  □ 

The  operator  A4  as  defined  by  (2.43)  -  (2.48)  satisfies 

(3.34)  \M{ui,op10,<j\0){t)  -  M{u2,a%0,ab20){t)\ 

<  jf  7(Jfe)|o?0(*)  -  <4>(*)l Mk)  +  (^T  +  §  /o  ds )  jo  l^1 

as  a  repeated  use  of  the  triangle  inequality  as  well  as  of  the  inequality  |  exp(— t)  — 
exp(— s)|  <  \t  -  s| ,  valid  for  t,  s  >  0 ,  shows.  It  was  proved  in  [2],  Theorem  A.5,  that 

(3.35)  f  |£i  —  62I ds  <  |criod  —  <4dl  +  /  |w  -u2\ds-\  —  f  \ui\\x\  —  x2\ds 

Jto  Jto  '  Jto 

holds.  Moreover,  by  the  standard  uniqueness  argument  for  variational  inequalities 
(see  also  Proposition  A.l  in  [2]),  one  has 

(3.36)  \xi{t)  ~x2(t)\<\a[0d-a%0d\+ [  jiq  -u2\ds. 

Putting  together  the  estimates  (3.34)  -  (3.36),  one  sees  that  M  satisfies  the  assump¬ 
tion  (3.31)  with  some  constant  A  which  depends  only  on  ||  14  ||u,  ||  u2  ||ltl  and  on 
the  problem  data.  Therefore  the  Lipschitz  estimate  (3.32)  holds  for  the  difference 
Uj  -u2  of  the  two  solutions.  It  extends  to  all  the  unknown  functions  in  the  Chaboche 
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model,  since  they  can  be  expressed  in  terms  of  u  and  £  as  shown  at  the  end  of  Sec¬ 
tion  2,  both  for  the  stress  controlled  and  the  strain  controlled  case.  Thus,  the  proof 
of  Theorems  2.3  and  2.4  is  complete. 

References 

1.  P.J.  ARMSTRONG  and  C.O.  FREDERICK,  1966,  A  mathematical  representation  of  the  mul- 
tiaxial  Bauschinger  effect,  C.E.G.B.,  Report  RD/B/N  731. 

2.  M.  BROKATE,  P.  KREJCI,  Wellposedness  of  kinematic  hardening  models  in  elastoplasticity, 
Math.  Model.  Numer.  Anal,  to  appear. 

3.  M.  BROKATE,  J.  SPREKELS,  1996,  Hysteresis  and  phase  transitions,  Springer- Verlag,  Berlin. 

4.  J.-L.  CHABOCHE,  1989,  Constitutive  equations  for  cyclic  plasticity  and  cyclic  viscoplasticity, 
Int.  J.  Plasticity ,  5 ,  pp.  247-302. 

5.  J.-L.  CHABOCHE,  1991,  On  some  modifications  of  kinematic  hardening  to  improve  the  de¬ 
scription  of  ratchetting  effects,  Int.  J.  Plasticity,  7,  pp.  661-678. 

6.  J.-L.  CHABOCHE,  1994,  Modeling  of  ratchetting:  evaluation  of  various  approaches,  Eur.  J. 
Mech.,  A/Solids,  13,  pp.  501-518. 

7.  M.  KAMLAH,  M.  KORZEN  and  CH.  TSAKMAKIS,  Uniaxial  ratchetting  in  rate-independent 
plasticity  laws,  Acta  Mechanica,  to  appear. 

8.  M.A.  KRASNOSEL’SKII  and  A.V.  POKROVSKII,  1989,  Systems  with  hysteresis,  Springer- 
Verlag,  Berlin.  Russian  edition:  Nauka,  Moscow  1983. 

9.  P.  KREJCI,  1996,  Hysteresis,  convexity  and  dissipation  in  hyperbolic  equations,  Gakkotosho, 
Tokyo. 

10.  J.  LEMAITRE  and  J.-L.  CHABOCHE,  1990,  Mechanics  of  solid  materials,  Cambridge  Univer¬ 
sity  Press,  Cambridge  1990.  French  edition:  Dunod,  Paris  1985. 

11.  G.A.  MAUGIN,  1992,  The  thermomechanics  of  plasticity  and  fracture,  Cambridge  University 
Press,  Cambridge  1992. 

12.  E.  MELAN,  1938,  Zur  Plastizitat  des  raumlichen  Kontinuums,  Ingenieur-Archiv,  9 ,  116-126. 

13.  W.  PRAGER,  1949,  Recent  developments  in  the  mathematical  theory  of  plasticity,  J.  Appl. 
Phys.,  20,  pp.  235-241. 

14.  A.  VISINTIN,  1994,  Differential  models  of  hysteresis,  Springer- Verlag,  Berlin. 


Martin  Brokate 
Mathematisches  Seminar 
Universitat  Kiel 
D-24098  Kiel,  Germany 


Pavel  Krejcf 

Institute  of  Mathematics 
Academy  of  Sciences 
Zitna  25 

CZ- 11567  Praha,  Czech  Republic 


International  Series  of  Numerical  Mathematics 
Vol.  126,  ©  1998  Birkhauser  Verlag,  Basel 


81 


On  the  Behaviour  of  the  Value  Function  of  a  Mayer 
Optimal  Control  Problem  along  Optimal  Trajectories 

PIERMARCO  CANNARSA  AND  MARIA  ELISABETTA  TESSITORE 

Dipartimento  di  Matematica 
Universita  di  Roma  “Tor  Vergata” 

Abstract.  We  consider  a  Mayer  optimal  control  problem  for  a  system  governed  by  a 
semilinear  evolution  equation  of  parabolic  type. 

We  are  interested  in  the  smoothness  of  the  related  value  function  V  along  an  optimal 
trajectory  £*(■)•  We  obtain  an  estimate  on  the  superdifferential  of  V  at  (t,x*(t))  which 
states  that 

dimD+V{t,x*(t))  <  1. 

This  result  may  also  be  regarded  as  a  necessary  condition  for  optimality. 

1991  Mathematics  Subject  Classification.  49K20,  49L10 

Key  words  and  phrases.  Mayer  problem,  value  function,  semi-concavity,  optimality  condi¬ 
tion. 


1.  Introduction 


In  this  paper  we  are  concerned  with  the  following  optimal  control  problem: 


(1.1)  minimize  g(x(T )) 

over  all  trajectory-control  pairs  {re,  7},  subject  to  the  semilinear  state  equation 

q  2n  f  x'(t)  +  Ax(t)  +  f{t,  x(t),  7(f))  =  0,  t  G  [t0,  T] 

'  '  }  \  x(t0)  =  x0 

Here,  x0  belongs  to  a  real  Hilbert  space  X,  t0  G  [0,  T]  and  —A  is  the  infinitesimal 
generator  of  an  analytic  semigroup.  For  simplicity,  we  assume  that  A  is  self-adjoint. 

A  control  7*(-)  is  said  to  be  optimal,  if  the  minimum  in  (1.1)  is  attained  at  7*;  the 
corresponding  trajectory  x*(-)  is  said  to  be  an  optimal  trajectory. 

The  value  function  V  of  problem  (1.1)— (1.2)  is  defined  as 

(1.3)  V(t0,x0)  =  inf{g(a:(T))|{a;,7}subject  to  (1.2)} 

and  satisfies  the  Dynamic  Programming  (or  Hamilton-Jacobi-Bellman)  equation 

-dtV (t,  x )  -I-  (Dx V ( t ,  x),  Ax)  +  H(t,  x,  DXV (: t ,  x))  =  0 

(1-4) 

Vt  e  (0,T),  VxeX 

see  [7].  In  the  above  equation,  H  :  [0,  T\  x  X  x  X  — ►  R  is  the  Hamiltonian  defined  as 
(1.5)  H(t,x,p)  =  sup -(p,/(t,  a:,  7)). 
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It  is  well  known  that  V  is  not  differentiable  in  general.  We  are  interested  in 
studying  if  it  gains  regularity  along  an  optimal  trajectory.  The  condition  we  derive 
in  the  sequel  can  therefore  be  seen  as  a  necessary  condition  for  optimality. 

There  are  several  reasons  suggesting  that  the  behaviour  of  V  should  be  better 
along  optimal  trajectories.  In  finite  dimensions  this  improvement  in  regularity  is 
known  for  problems  with  a  “strictly  convex  structure” ,  such  as  Calculus  of  Variations 
(see  [8]),  some  Minimum  Time  problems  (see  [4]),  and  problems  with  infinite  horizon 
(see  [10]).  Indeed,  in  such  examples,  the  value  function  V  is  differentiable  along  an 
optimal  trajectory. 

For  Mayer  problems,  however,  the  Hamiltonian  H  is  homogenuos  of  degree  one 
in  p,  and  so  it  is  not  strictly  convex.  Nevertheless,  by  a  careful  application  of  the 
Dynamic  Programming  equation  (1.4),  we  deduce  that  the  value  function  V  cannot  be 
too  singular  along  an  optimal  trajectory  x*{t),t  £  ( to,T ),  of  a  sufficiently  “smooth” 
problem.  In  fact,  we  prove  the  estimate 

(1.6)  dimD+V(t,  x*(t))  <  1,  Vt  £  (t0,T), 

which  bounds  the  dimension  of  the  superdifferential  of  V  along  x*(t).  This  result  is 
new  also  for  finite  dimensional  problems. 

Finally,  we  would  like  to  observe  that  (1.6)  does  not  hold  for  an  arbitrary  Mayer 
problem,  but  requires  a  smoothness  assumption  on  the  set  f(t,  x*(t),  U )  of  admissible 
velocities.  Indeed,  if  H  vanishes,  then  equation  (1.4)  becomes  too  weak  a  condition 
on  V  to  prevent  the  generation  of  higher  singularities.  We  discuss  this  phenomenon 
in  Example  3.5. 


2.  Preliminaries 

Let  X  be  a  real  Hilbert  space  and  U  a  complete  separable  metric  space.  Fix  T  >  0 
and  let  ( to,Xo )  £  [0,T]  x  X.  Consider  the  problem  of  minimizing  the  functional 

(2.1)  =  g(x(T;t0,x0,'y)) 

over  all  measurable  functions  7  :  [0, 00)  — *  U  (usually  called  controls).  Here  g  : 
X  — >  R  is  a  given  continuous  function  and  x(-;f0,2;o!7)  is  the  mild  solution  of  the 
semilinear  state  equation 


x'(t)  +  Ax(t)  +  f(t,  x(t),  7 (t))  =  0,  t  £  [t0,  T\ 
x(t0)  =  x0, 


that  is  the  solution  of  the  integral  equation 

(2.3)  x(t)  =  e~^~to^Ax0  -  [  e~^t~^Af(s,x(s),'y(s))ds 

Jto 

for  all  t  £  [to,T]. 
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In  the  above  Mayer  optimal  control  problem  we  impose  the  following  assumptions 
on  the  data: 

(i)  A:  D(A)  C  X  — >  X  is  self-adjoint  and  generates 
an  analytic  semigroup,  etA,t>  0; 

(ii)  f  :  [0,  T]  x  X  x  U  — >  X  is  continuous  and  such  that 

(2.4)  7)|  <  0,(1  +  |x|), 

\f{t,x,'y)-f(s,y,'y)\  <  C0  [|f  -  s|  +  \x  -  y\] 

for  some  Co  >  0  and  all  t,  s  £  [0,  T] ,  x,  y  £  X,  7  6  U\ 

(in)  g  is  Lipschitz  on  all  bounded  subsets  of  X. 

It  is  well  known  that,  under  the  above  assumptions,  problem  (2.3)  has  a  unique  mild 
solution  x(-)  £  C([t0,T]-X). 

Let  12  be  an  open  subset  of  X  and  tp  :  Q  — »  M. 

Definition  2.1.  For  any  fixed  xq  £  Q,  the  semi- differentials  of  ip  at  x$  are  defined  as 

and  called  super  and  subdifferential  of  (p  at  x0,  respectively  (see  [5]j. 

The  semi-differentials  D+<^(x0)  and  D~ip(x0)  are  both  non-empty  if  and  only  if  p 
is  Frechet  differentiable  at  x0.  In  this  case  we  have 

D+p(x  0)  =  D~p(x0)  =  (V^(x0)} 

where  Vp  denotes  the  gradient  of  p. 

Definition  2.2.  We  denote  by  D*p(x0 )  the  set  of  all  points  p  £  X  for  which  there 
exists  a  sequence  {xn}nejv  in  X  with  the  following  properties 

((i)  xn  converges  to  x0  as  n  —>  00 

(ii)  p  is  Frechet  differentiable  at  xn,Vn  £  N 
(Hi)  Vp(xn)  weakly  converges  to  p  as  n  — >  00 

If  p  is  Lipschitz  in  a  neighborhood  of  x0,  then  p  is  Frechet  diffentiable  on  a  dense 
subset  of  fi.  Consequently,  D*p(x 0)  f>. 

Let  now  Q  be  convex  and  set  Br(x 0)  =  {x  £  X  \  \x  —  Xq\  <  r}. 

Definition  2.3.  We  say  that  p  is  semi-concave  if  there  exists  a  function 

lo  :  [0,  +00)  x  [0,  +00)  — >  [0,  +00) 

(i)  u(r,  s)  <  cv(R:  S),  V0  <  r  <  R,  V0  <  s  <  S 
(ii)  limw(r,  s)  =  0,  Vr  >  0 

slO 
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and  such  that 


(2.6)  X(p{x)  +  (1  -  A )tp(y)  -  <p( Xx  +  (1  -  A )y)  <  A(1  -  A)|a:  -  y\u(r,  \x  -  y\) 

for  every  r  >  0,  A  G  [0, 1]  and  x,y  G  f2  fl  Br(0). 

The  superdifFerential  of  a  semi-concave  function  has  several  useful  properties,  some 
of  which  are  recalled  in  the  following 

Proposition  2.4.  If  <p  is  semi-concave  in  Br(x0)  for  some  r  >  0,  then 

(2.7)  D+ip(  xQ)  =  WD*tp(x  0) 

where  co  denotes  the  closed  convex  hull.  In  particular  D+(p(x0)  ±  <j>. 


Remark  2.5.  For  a  semi-concave  function  the  semidifferential  of  interest  is  the  su¬ 
perdifferential,  since  either  the  subdifferential  is  empty  or  it  coincides  with  the  su¬ 
perdifferential  and  the  function  is  differentiable. 

We  define  the  value  function  of  problem  (2.1)-(2.2)  as 


(2.8)  V(t0,x 0)  =  inf{<?(:r(T;  to,x0, 7))|  7  :  [t0,T\  ->  U  is  measurable  }. 

The  result  below  will  be  applied  in  the  next  section.  We  denote  by  DfV(t,  x)  the 
superdifferential  of  V(t,  •)  at  x.  The  proof  of  the  Proposition  below  is  given  in  [3]. 

Proposition  2.6.  Assume  (2.4)  and  let  (t0,x0)  G  [0,  T)  x  X.  Then,  for  all  a  G  [0, 1), 

DfV(to,x0)  C  D(Aa)  &  DfV(t0,x0)  C  D(Aa). 


Under  additional  assumptions  on  the  data,  the  value  function  V  is  semi-concave 
in  (t,x)  on  [0,T)  x  X.  More  precisely,  the  following  result  is  obtained  in  [3]. 

Theorem  2.7.  Assume  (2.4),  and  suppose  that  there  exists  a  G  (0, 1]  such  that,  for 
all  R>  0  and  for  some  constant  CR  >  0, 


r  (o 


(2.9) 

I 

(**) 

Then  for  any  r  > 


/(-,-,  7)  is  differentiable  and 
~  9(i^)(S’  y’7) 


<  CR{\x  -y\  +  \t-  s|)“ 


for  all  s,t  G  [0,T],x,y  G  Rj?(0),7  G  U\ 

9{x)  +  g(y )  -  2 g(^)  <  CR\x  -  y\1+a,  Va,  y  e  BR( 0). 

0  there  exists  Cr  >  0  such  that 


(2.10)  V{t1,x1)+V{t0,x0)-2v(^^,^^'Sj<qCr{\t1-t0\  +  \x1-x0\)1+a 


for  all  ti,f0  G  [0, T  —  1]  and  all  xi,x0  €  Br(0). 

We  conclude  this  section  with  few  remarks  about  the  Dynamic  Programming  equa¬ 
tion  (1.4)  of  a  Mayer  optimal  control  problem.  As  we  recalled  above,  (1.4)  is  satisfied 
in  a  suitable  generalized  sense,  as  V  is  in  general  not  differentiable  and  the  coefficient 
of  the  linear  term  (DxV(t,x),Ax)  is  defined  only  for  x  G  D(A).  However,  we  know 
from  Proposition  2.6  that  D+V{t, x)  C  D(Ae)  for  all  9  G  [0,1).  Moreover,  every 
trajectory  of  (2.2)  enters  the  fractional  domain  D(A1~e)  as  soon  as  t  >  to.  Therefore, 
all  the  terms  in  the  following  equality  are  well  defined. 
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Theorem  2.8.  Assume  (2.4),  (2.9).  Let  x*(-)  be  an  optimal  trajectory  of  problem 

(2.2) -(2.1)  and  6  G  (0, 1).  Then, 

(2.11)  -pt  +  (Aepx,A1~ex*(t))  +  H(t,x*(t),px)  =  0 

for  all  t  G  (t0,T)  and  all  ( pt,px )  £  D+V(t,x*(t)). 

The  above  result  is  all  we  need  to  know  about  equation  (1.4)  in  the  sequel.  For  its 
proof,  see  [3],  Theorem  5.2. 

3.  An  Optimality  Condition 

We  now  show  that  the  results  of  the  previous  section,  toghether  with  some  hy¬ 
potheses  on  the  control  set  and  on  the  dynamics  /  in  the  state  equation  (2.2),  can 
be  used  to  study  the  structure  of  the  singular  set  of  the  value  function  V  associated 
with  a  Mayer  optimal  control  problem. 

In  addition  to  the  Dynamic  Programming  equality  (2.11),  the  proof  of  our  main 
result  is  based  on  some  notions  of  convex  analysis.  These  results  are  adapted  from 
[4] ,  where  they  were  obtained  in  a  finite  dimensional  set-up. 

We  recall  that  the  support  function  to  a  convex  set  K  C  X  is  given  by 

=  sup(A :,p). 

k€K 

If  k  G  K,  the  normal  cone  to  K  at  k  is  the  set 

NK(k)  =  {P  €  X  :  {p,k-  k)  >  0  Vfc  G  K}. 

Moreover,  given  a  convex  set  K,  we  consider  the  smallest  affine  set  that  contains  K. 
This  set  is  called  the  affine  hull  of  K  and  is  denoted  by  aff(A).  The  relative  interior 
of  a  convex  set  K  is  defined  as  the  interior  which  results  when  K  is  regarded  as  a 
subset  of  its  affine  hull  aff(X). 

Lemma  3.1.  Let  Ki,  K2  C  X  be  bounded  closed  convex  sets  and  suppose  that  K2  is 
not  a  singleton.  Then  the  following  two  properties  are  equivalent: 

there  exists  k  G  K\  such  that  K2  C  Nk1  ( k )  ; 

(rKl(XPo  +  (1  -  A)pi)  =  \aKl(po)  +  (1  -  X)crKl{Pi) 

VpojPi  £  A2,  VA  G  [0, 1]. 

Proof.  From  the  definition  of  normal  cone 

P  G  NKl  (k)  «=>  aKl(p)  =  {p,k). 

This  yields  that  (i)  implies  (ii). 

Conversely,  let  us  suppose  that  (ii)  holds.  Let  p  be  a  point  in  the  relative  interior  of 
K2.  Since  K1  is  a  weakly  compact  set  and  Die  map  k  — >  (k,p)  is  weakly  continuous, 
there  exists  k  G  Ki  such  that  <7^  (p)  =  (jp,k). 

By  definition, 

(3.2) 


(3.1) 


(0 

(ii) 


VKrip)  >  {p,k),  Vp  G  K2. 
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Let  us  suppose  that  there  exists  p0  £  K2  such  that  crKl(p0)  >  (p0.  k).  Since  p  is 
in  the  relative  interior  of  K2,  there  exist  pi  £  K2,pi  ^  p,  and  A  £  (0, 1)  such  that 
p  =  \p0  +  (1  -  A)pi .  Then 

VKiipi)  =  (1  -  A)“V/o(p)  -  Au/f^po))  <  (1  -  A)_1(p  -  Apo,  k)  =  (pi,k), 

in  contraddiction  with  (3.2).  It  follows  that  (3.2)  holds  as  an  equality  for  every 
p  £  Ko-  Therefore  K2  C  Nxl  (k).  ■ 

We  denote  the  segment  from  a  point  a;  to  a  point  y  as 

[x,  y]  =  {Ax  +  (1  -  A)y|  A  G  [0, 1]}. 

Theorem  3.2.  Assume  (2.4),  (2.9)  and  let  x*(-)  be  an  optimal  trajectory  of  problem 

(2.2)  -(2.1)  such  that 

(3.3)  f(t,  x*(t),  U)  is  a  closed  convex  set  with  boundary  of  class  Cl 
for  any  t  £  ( to,T ).  Then, 

(3.4)  dimD+ V(i,  x*(t))  <  1 ,  Vf  £  (t0,  T). 

Moreover,  for  any  t  £  (to,T)  there  exists  p(t)  £  M  x  X  and  p(t)  £  [0, 1]  such  that 

D+V{t,x*(t))  =  [p(t)p(t),p(t)}. 

Proof.  Fix  t  £  (to,T).  Then  either  dimD+V(t,x*(t))  =  0  or  dimD+V(t,x*(t))  >  0. 
Clearly  if  the  first  case  holds  there  is  nothing  to  prove.  Let  us  consider  the  second 
case.  Assume  that  dimD+V(t,x*(t))  >  1  and  take  two  elements  ( pt,px )  and  (pt,p'x) 
in  D+V(t,x*(t)).  Consider  p*  =  Apx  +  (1- A)p!,,  A  £  [0, 1].  Then,  since  D+V(t,x*(t)) 
is  a  convex  set,  ( Pt,Px )  £  D+V(t,x*(t)).  Recalling  that 

H(t,X  (t),Px)  =  &f(t,x*(t),U)(Px)i 

we  evaluate  equation  (2.11)  at  (pt,Px)  to  obtain 

&  f(t,x*(t),u)  (Apx  +  (1  A  )px) 

pt  -  A9(Xpx  +  (1  -  A )px),Al~ex*{t)}  =  \(T f (t,x* (t) ,U) (.Px )  +  (1  -  A)<T/(t,x*(t),f7)(px). 

Therefore,  on  the  sections  of  D+V(t,x*(t))  at  the  level  pt,  0f(t,x*(t),u){')  is  a  linear 
function.  By  Lemma  3.1  and  assumption  (3.3),  we  derive  that  px  =  ppx  for  some 
p  >  0.  We  now  evaluate  equation  (2.11)  at  ( Pt,Px )  and  (p*,px)  to  obtain 

<7f{t,x*{t),u){Px)  +  (A6px,  A1  °x*(t)^  =  pt 

and  paf(t,x*(t),u){Px)+ p(A9px,Al~ex*(t))  =  pt. 

Hence, 

Vf(t,x-(t),u){Px)  +  (Aepx,  A1_V(t))  =  (t),u)  (j>x )  +  (Aepx,Al~ex*{t))\, 

which  yields  p  =  1.  Therefore,  for  any  pt  there  exists  at  most  one  px  such  that 
(Pt)Px)  G  D+V(t,x*(t)). 
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Let  p°(t),pl(t)  G  D+V(t,x*(t))  be  such  that 

Pt  =  min{pt|  3px  :  (px,pt)  G  D+V(t,x*(t))} 

and  p\  =  max{pt|  3px  :  (px,pt)  E  D+V(t,x*(t))}. 

By  the  convexity  of  the  set  D+V(t,x*(t)),  we  derive  that  there  are  no  points  px 
such  that  ( Pt,Px )  €  D+V(t,x*(t))  with  pt  <  p °  or  pt  >  p ].  On  the  other  hand,  for 
any  pt  such  that  p°  <  Pt  <  p\,  there  exists  at  least  a  px  so  that  (pt,px)  belongs  to 

D+V(t,x*(t)).  Moreover  px  is  unique  since  we  proved  before  that  on  any  section  of 

D+V(t,x*(t))  at  the  level  pt  there  is  only  one  px,  hence  dimD+V(t,x*(t ))  <  1.  This 
yields  = 

We  now  prove  the  last  statement  of  the  theorem.  Plugging  p°  =  pQ{t),pl  =  pl{t) 
and  px  =  A p°  +  (1  —  X)p1,  A  G  [0, 1]  in  equation  (2.11)  we  obtain 

-Pt  +  (Aep°x,  J41-V(t)}  +  o-/(t,x*(t),y)(p2)  =  0, 

~Pt  +  {AeP^Al~eX*{t))  +  (Tf  (t,x'{t),U){pl)  =  0, 

-Pt  +  ( a6Pxi  Al~ex*{t))  +  <Jj(t,X'(t),u){Px)  =  0. 

From  the  above  equations  we  derive 

c f(t,x*(t),U)(Px)  =  ^ af(t,x*(t),u)(p° )  +  (1  -  X)Vf(t,x*(t),U)(pD- 

Therefore,  crf(t,x*{t),u){‘)  is  linear  when  it  is  restricted  to  the  ortogonal  projection  of 
D+V(t,x*(t ))  on  X,  denoted  by  Ux{D+V(t,x*(t))).  Hence 

nx(T»+y(t,a;‘(t)))  =  [pitfp^pl], 

for  some  pit)  G  [0, 1],  which  yields  =  p{t)p\.  Again  from  equation  (2.11)  we  get 
p°t  —  p{t)p\  and  the  proof  is  concluded.  ■ 

Remark  3.3.  The  above  Theorem  holds  for  any  real  Hilbert  space  X  such  that 
dim  X  >  1.  However,  if  dim  X  =  1,  then  assumption  (3.3)  is  meaningless.  In 
this  case  we  only  require  that  f{t,x*{t),U)  ^  {0}  for  all  t  G  {t0,T),  in  order  to 
assure  that  crf(t,X’(t),u)  i1  0. 

Remark  3.4.  The  last  statement  of  the  above  Theorem  shows  that  D+V{t,x*{t)) 
must  be  a  radial  segment,  contained  in  a  half-line  starting  from  the  origin.  This  fact 
is  a  very  strict  requirement  and  suggests  that,  even  for  Mayer  problem,  the  value 
function  might  be  differentiable  along  optimal  trajectories  provided  that  f(t,  x,  U )  is 
sufficiently  smooth.  This  question  is,  however,  still  an  open  problem. 

The  above  argument  heavily  relies  on  assumption  (3.3),  that  allows  to  use  the 
Hamilton- Jacobi-Bellman  equation  (2.11)  to  study  D+V.  However,  such  a  condition 
is  also  necessary  for  an  estimate  like  (2.11)  to  hold  true,  as  we  now  show  with  the 
following  example. 

Example  3.5.  We  consider  the  state  equation  in  X  =  R2 

f  x’(s)  =  7i(s)x(s),  s  G  [t,T\,  x(t)  =  x 
\  y\s)  =  7 2{s)y{s),  s  G  [t,T\,  y{t )  =  y. 
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with  U  =  [0, 1]  x  [0, 1],  Notice  that  assumption  (3.3)  is  not  satisfied  on  xy  =  0.  The 
value  function  V  is  defined  as 

V(t,x,y )  =  inf{z(T)  +y(T)\  (71,72)  :  [t,T]  ->  U  measurable}. 

Then  is  not  difficult  to  see  that 

(x  +  y  x,y>  0 

eT~tx  +  y  x  <  0,  y  >  0 

x  +  eT~ty  x  >  0,  y  <  0 

eT_t(x  +  y)  x,y  <  0 

We  note  that  the  (unique)  optimal  trajectory  at  (0, 0)  is  given  by  x(t)  =  0,  y(t)  = 
0,  t  €  [0,T].  Moreover,  by  Proposition  2.4,  we  have 

D+V{t,  0, 0)  =  co  {(1, 1, 0),  (eT-t,  1, 0),  (1,  eT~‘,  0),  (< zT~\  eT~\  0)}, 

and  so  dim  D+V(t,  0,0)  =  2  for  all  t  S  [0,  T). 
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Abstract.  We  consider  optimal  control  problems  governed  by  semilinear  elliptic  equations 
with  pointwise  state  constraints.  The  set  of  control  constraints  is  a  subset  of  a  ZALebesgue 
space  with  1  <  p  <  oo,  it  is  described  by  pointwise  and  integral  constraints.  We  prove  a 
Pontryagin  principle  in  integral  form. 
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1.  Introduction 

During  the  ten  last  years,  many  papers  have  been  devoted  to  Pontryagin  principles 
for  control  problems  governed  by  partial  differential  equations  (see  [18],  [6]).  In  the 
presence  of  state  constraints,  Pontryagin’s  principles  are  often  proved  thanks  to  the 
Ekeland’s  variational  principle  (applied  to  problems  in  which  state  constraints  are 
penalized),  coupled  with  methods  of  spike  perturbations.  To  apply  the  Ekeland’s 
variational  principle,  the  space  of  controls  Va(j  endowed  with  the  so-called  Ekeland 
distance,  must  be  complete.  This  assumption  of  completeness  is  in  general  not  satis¬ 
fied  if  Vad  is  a  subset  of  a  IT-Lebesgue  space  with  1  <  p  <  oo  (see  [14]).  This  is  the 
reason  why  most  applications  of  that  method  deal  with  bounded  controls. 
Extensions  to  problems  with  unbounded  controls  are  considered  in  [14],  [12],  [19].  The 
idea  is  to  use  perturbations  whose  difference  with  an  optimal  solution  is  bounded  in 
a  IT-space  with  1  <  p  <  oo.  Extensions  of  [14],  [12]  are  given  in  [11],  The  method 
in  [19]  has  been  improved  in  [20]  by  considering  perturbations  whose  difference  with 
an  optimal  solution  is  bounded  in  L°°. 

In  this  paper  we  want  to  show  that  the  method  developed  by  Li-Yong  [17],  [18]  and 
by  Casas  [6]  (first  introduced  by  Li-Yao  [16]),  and  extended  to  unbounded  controls  in 
[19],  [20],  can  also  be  extended  to  problems  with  integral  control  constraints  (see  the 
beginning  of  Section  3.2).  Let  us  explain  what  is  new.  In  [11],  [12],  [20],  the  sets  of 
admissible  controls  are  patch  complete  in  the  sense  introduced  by  H.  O.  Fattorini  [12] 
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(that  is,  a  perturbation  of  an  admissible  control,  over  a  subset  of  small  measure,  by 
an  other  admissible  control,  is  still  admissible).  Notice  that  constraints  of  the  form 
\\V\\LP  <Cdo  not  correspond  to  a  patch  complete  subset  of  LP  if  1  <  p  <  oo.  Here 
the  novelty  is  that  we  still  obtain  a  Pontryagin’s  principle,  even  in  cases  when  the 
set  of  admissible  controls  is  not  patch  complete.  When  the  set  of  admissible  controls 
is  not  patch  complete,  we  cannot  use,  as  in  [20],  perturbations  whose  difference  with 
an  optimal  solution  is  bounded.  This  is  the  reason  of  another  difficulty  because 
there  is  a  gap  between  convergence  properties  implied  by  the  Ekelend  variational 
principle  (Lemma  3.1)  and  convergence  properties  necessary  to  prove  some  continuity 
properties  of  the  cost  functional  (see  [6],  Theorem  5.1  and  [20],  Lemma  3.1).  Contrary 
to  [6],  [20],  here  we  only  prove  a  lower  semicontinuity  property  (Proposition  3.1,  ii) 
under  some  convexity  condition  of  the  cost  functional  with  respect  to  the  control 
variable  (observe  that  this  convexity  condition  is  not  needed  in  [6],  [20]). 

For  simplicity,  we  present  this  extension  for  problems  governed  by  semilinear  elliptic 
equations,  but  the  same  kind  of  results  can  be  obtained  for  problems  governed  by 
semilinear  parabolic  equations  as  those  considered  in  [6],  [13],  [19],  [20]. 


2.  Setting  of  the  Problem 


Let  fl  be  an  open  bounded  subset  of  RN  ( N  >  2)  with  a  Lipschitz  boundary  T,  q, 
r  and  f  denote  positive  numbers  satisfying 

N  , 

g>y,  r  >  r  >  N  —  1 

We  consider  a  second  order  differential  operator  defined  by: 

N 

Ay  =  —  'y  Di(dij(x)Djy), 
i,j= 1 

(Di  denotes  the  partial  derivative  with  respect  to  xt)  with  coefficients  Oy  belonging 
to  L°°(Q)  and  satisfying  for  some  m0  >  0 


^2  aij(x)£i(,j  >  m0 1£|2  for  all  (el1''  and  a.e.  x  ett. 
i,j= 1 

We  consider  the  following  boundary  value  problem: 


(2.1) 


Ay  +  $(x,y)  =  0  in  O, 


dv 

-2-  +  ^>{s,y,v)  =  0  on  T. 
dnA 


The  function  $  (resp.  \k)  is  a  Caratheodory  function  from  Q  x  R  (resp.  from  T  x  M2) 
into  R.  For  almost  every  ie!l,  (resp.  almost  every  s  €  T  and  every  v  G  R),  $(a:,  •) 
(resp.  \I >(s,  •,?;))  is  of  class  C 1  and  we  have  the  following  estimates 


|$(a:,0)|  <  Mr  (a:),  0  <  a0  <  &y(x,y)  <  M^x^yl), 


|^(s,  0,  v)\  <  M2{s )  +  0  <  b0  <  ^(s,  y,  v)  <  (M2(s)  +  mi|w|)r/(|y|), 

where  Mi  G  Lq{Q),  M2  £  Lr(T),  m-y  is  a  positive  constant,  and  rj  is  a  nondecreasing 
function  from  R+  into  R+  (we  have  denoted  by  &y  the  partial  derivative  of  <E>  with 
respect  to  y,  we  adopt  in  all  the  sequel  the  same  kind  of  notation  for  other  functions). 
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In  addition,  we  assume  that  the  pair  ( a0,b0 )  satisfies  the  ellipticity  condition  (Em) 
stated  below. 


We  say  that  a  pair  of  nonnegative  functions  ( a,b )  E  Lg(Q)  x  Lr(r)  satisfies  the 
ellipticity  condition  (Em)  if: 


(Em)  inf{  ^ 1  [J^OijDjyDiy  +  ay2)  dx  +  by 2  ds]  \  y  E  H1^)}  >m>  0. 


Remark  2.1.  Since  a  and  b  are  nonnegative  functions,  if  fi  is  connected,  the  ellipticity 
condition  (Em)  is  satisfied  for  some  m  >  0  if  and  only  if  (a,  b)  is  not  identically  zero. 

Remark  2.2.  For  every  v  E  Lr(T),  equation  (2.1)  admits  a  unique  weak  solution  in 
H *(17)  fl  C(Cl)  (see  Section  2). 


We  consider  the  following  optimal  control  problem: 

(P)  inf {J(y,v)  |  (y,v)  E  ( H 1(Q)  nC(Q))  x  V,  (y,v)  satisfies  (2. l)-(2.2) }, 

where 

J(y,v)=  I  F(x,y(x))dx  +  /  G(s,y(s),v{s))ds, 

Jn  Jr 

(2.2)  f{x,  y(x))  <  0  for  every  x  E  Cl, 

V  =  {v  E  LT(T)  |  v(s)  E  K(s),  j  g{s,v{s))ds  =  0,  J^h(s,v(s))  ds  <  0}, 

where  K  is  a  measurable  multimapping  from  T  with  closed  and  nonempty  values  into 
P(M). 

We  suppose  in  the  sequel  that  F  (resp.  G )  is  a  Caratheodory  function  from  Qxl 
(resp.  from  T  x  R2)  into  R.  For  almost  every  x  E  fl,  (resp.  almost  every  s  E  T  and 
every  v  E  M),  F(x,  •)  (resp.  G(s,  ■ ,v ))  is  of  class  C 1  and  we  have 

|P(:r,0)|  <  M3(x),  \F'y(x,y)\  <  M3(x)r](\y\) , 

0  <  G(s,  0,  v)  <  Mi(s)  +  m1\v\r ,  ]G’y(s,y,v)\  <  (M4(s)  +  TOX|i;|r)7/(|j/|), 

where  M3  E  Z/1  (F2),  M4  E  ^(T),  mj  and  rj  are  the  same  as  before.  For  almost 
every  s  E  T  and  every  y  E  M,  G(s,y,  •)  is  convex  on  R.  The  functions  g  and  h  are 
Caratheodory  functions  from  Txl  into  K  satisfying 

|<?(s,u)[  <  Mi{s)  +  mi|u|r,  — M4(s)  —  mi|u|r  <  h(s,v)  <  M4(s)  +  mi|u|r, 

mi  and  M4  are  the  same  as  before.  We  also  suppose  that  the  function  /  :  Cl  x  M  — >  M 
is  continuous  and  that  for  every  x  E  Cl,  f(x,  •)  is  of  class  C1. 

The  main  result  is  the  following  Pontryagin  principle. 

Theorem  2.1.  If  (y,v)  is  a  solution  of  (P),  there  then  exist  (v,  jl)  E  R+  x  M(Cl)  and 
p  belonging  to  W1,T(f2)  for  all  1  <  r  <  N/(N  —  1)  such  that 

(2.3)  (h,  p)  0,  p,  >  0,  {ft,  f(-,y(-)))M(si)xC( a)  =  0. 
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(2.4) 


r  A*p  +  <S>'y{x, y)p  =  vF^x, y)  +  pnfy(x, y)  inti, 
1  9^7  +  %(s,y,v)p  =  vG'y{s,y,v)  +  firffay)  inT, 


(2.5)  J  H(s,y(s),v(s),p(s),v)ds  <  J^H(s,y(s),v(s),p(s),v)ds 

for  every  v  G  V,  where  H(s,y,v,p,v )  =  vG(s,y,v)  +  p$(s,y,v),  A*p  = 
-  Ef)=i  Dj(aij(x)Dtp),  pa  is  the  restriction  of  ft  to  Q  and  jSr  is  the  restriction  of  ft 
to  r. 


3.  Technical  results 

3.1.  State  equation.  Adjoint  equation. 

Theorem  3.1.  For  every  v  G  Lf(T),  there  exists  a  unique  weak  solution  yv  G  H1^)  fl 
L°°(Q)  of  (2.1).  This  solution  belongs  to  C(Q)  and  we  have 

Ibulli/qn)  +  l|y«llc(n)  <  GA{1  +  ||v||z,f(r)), 

where  CA  depends  on  f  but  does  not  depend  on  v.  Moreover,  the  mapping  v  t — >  yv  is 
continuous  from  Lr(T)  into  C(Q). 

Proof.  See  [1]. 

Let  (a,  b)  G  Lq(fl)  x  Lr(V)  be  a  pair  of  nonnegative  functions  satisfying  (Em).  We 
consider  the  following  boundary  value  problem 

dp 

(3.1)  A*p  +  ap  =  pn  in  Q,  +  bp  =  pT  in  I\ 

where  p  =  pa  +  pr  is  a  Radon  measure  on  Cl.  We  shall  say  that  p  G  W1,1^)  is 
solution  of  (3.1)  if  ( ap,bp )  G  L1(f2)  x  L1(r),  and  if  : 

J  (]T  aijDipDjip  +  aptp)  dx  +  Jr  bTP  ds  =  (/*>  t)m(Q)xC(Ci) 
for  every  <p  G  W1,0°(f2).  Following  [1],  we  have 

Theorem  3.2.  For  every  pair  of  nonnegative  functions  (a,  b)  G  Lq(fl)  X  Lr(T)  satis¬ 
fying  (Em)  and  every  p  G  M{Cl)  there  exists  a  unique  solution  p  G  VF1,:l(fl)  of  (3.1) 
satisfying 

p(Ay  +  ay)  dx  +  ^pIq^  +  bv) ds  =  y)x(fi)xC(fl) 

for  every  y  G  {y  G  H' (fl)  |  Ay  G  Lq( Q),  ^  G  Lr(r)}.  Moreover,  this  solution 
belongs  to  W1'T( fl)  for  every  1  <  r  <  N/(N  -  1),  and  there  exists  a  positive  constant 
C5  =  C5(t),  not  depending  on  a,  •&  and  p,  such  that 

Ibllw^(fi)  <  Cl5i|p,||A^(n). 
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Proof.  The  proof  is  given  in  [1],  Theorem  4.2.  It  is  well  known  that  (3.1)  can  admit 
more  than  one  solution  [21].  However,  there  is  a  unique  one  satisfying  the  Green 
formula  stated  in  Theorem  3.2.  □ 


We  also  need  in  Section  3.3  the  following  regularity  results. 


Theorem  3.3.  For  every  pair  of  nonnegative  functions  (a,  b )  E  Lq(Q)  x  Lr(T)  sat¬ 
isfying  ( Em )  and  every  <fi  E  L9(Q.),  ip  E  Lr(T),  there  exists  a  unique  weak  solution 
y  E  if1(Q)  of  the  equation: 


Ay  +  ay  =  <p  in  G, 


-p-  +  by  =  ip  onT, 
OUa 


this  solution  belongs  to  Ca(fl)  for  some  0  <  a  <  1  and  we  have  the  following  estimates 


l|y[|c(n)  <  C6(||^||w(fi)  +  HV'lli.’-(r)), 

IMIc“(n)  <  CVGI^IIx/qn)  +  IIV'IUtcX1  +  IHU«(n)  +  IHUr(r)), 

where  Cg  and  C-j  are  independent  of  (p,  ip,  a  and  b. 


Proof.  The  first  part  is  proved  in  [1].  Thanks  to  the  first  estimate,  the  second  one 
can  be  proved  with  regularity  results  in  [15]. 


3.2.  Metric  space  of  controls.  In  methods  developed  in  [4],  [6],  [18]  to  prove  a  Pon- 
tryagin  principle  the  set  of  admissible  controls  Vad  is  endowed  with  the  so-called 
Ekeland  distance  dE-  To  apply  the  Ekeland  variational  principle,  the  space  ( Vad,dE ) 
must  be  complete  and  the  mapping  v  i — >  yv  must  be  continuous  from  this  metric 
space  to  C(f!!).  Approximate  optimal  solutions  are  then  characterized  by  an  ap¬ 
proximate  Pontryagin’s  principle  thanks  to  some  method  of  perturbation.  When  the 
control  set  is  defined  by 

Vad  =  {v  E  L°°(T)  |  v(s)  E  K  for  a.e.  s6P) 

(where  K  is  a  compact  subset  in  M),  the  above  conditions  are  satisfied  and  the 
method  of  spike  perturbations  can  be  used  to  recover  a  Pontryagin’s  principle  [4]. 
In  our  case,  (V,  djf)  is  not  complete,  the  mapping  v  \ — >  yv  is  not  continuous  from 
(V,dE)  into  (7(0),  and  the  method  of  spike  perturbations  cannot  be  used  because  a 
spike  perturbation  of  an  admissible  control  is  not  necessarily  admissible. 

An  other  kind  of  perturbation,  first  introduced  by  Li-Yao  [16],  has  been  developed 
by  Li-Yong  [17],  [18],  and  by  Casas  [6].  This  kind  of  perturbation,  called  diffuse 
perturbation,  has  been  adapted  in  [20]  to  treat  problems  with  unbounded  controls. 
The  idea  is  to  use  diffuse  perturbations  whose  difference  with  an  optimal  solution  is 
bounded  in  L°°(T).  Here,  by  using  (as  in  [19])  diffuse  perturbations  whose  difference 
with  an  optimal  solution  is  bounded  in  Lr[T),  we  can  consider  problems  with  integral 
control  constraints. 


We  define  a  new  metric  space  in  the  following  way.  Let  v  be  in  V  (v  will  be  an 
optimal  boundary  control  that  we  want  to  characterize).  For  0  <  M  <  oo,  let  us  set: 

V(M)  =  {u  e  V  |  ||v  -  u||tr(r)  <  M}, 


and  define: 


dE(v i,v2)  =  CN  x({s  G  T  |  iq(s)  ^  v2(s)}). 
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Lemma  3.1.  Let  M  >  0  and  {(un)nW}  C  V(M).  If  (vn)n  tends  to  v  in  ( V(M),dE ), 
then  ( vn)n  tends  to  v  in  Lr(T). 

Proof.  The  proof  is  immediate  if  we  remark  that,  since  1  <  r  <  r,  we  have 

J^\v-vn\rds  <  ||n-u„||ir(r)(d£;(n„,n))~  <  (2 M)T  (dE{vn,v))~ .  □ 

Proposition  3.1.  For  every  M  >  0,  we  have  : 

i)  (V(M),dE)  is  a  complete  metric  space, 

ii)  the  mapping  which  associates  yv  with  v  is  continuous  from  ( V(M),dE )  into 

C$1), 

ii)  the  mapping  which  associates  J(yv,v)  with  v  is  lower  semicontinuous  from 
(V(M),  dE)  into  R. 

Proof. 

i)  Let  (vn)n  be  a  Cauchy  sequence  in  (V(M),dE).  Following  [9],  we  can  prove 
that  (vn)n  converges  for  the  distance  dE  to  some  measurable  function  v  such  that 
v(s)  e  K(s )  for  a.e.  s  e  T.  As  in  the  proof  of  Lemma  3.1,  we  can  prove  that  (vn)n  is 
a  Cauchy  sequence  in  Lr(T).  Moreover,  (vn)n  is  bounded  in  17(r).  Therefore  (vn)n 
converges  to  v  strongly  in  Lr( F)  and  weakly  in  Lr(T).  On  the  other  hand,  by  using 
Fatou’s  Lemma  (applied  to  the  sequence  of  functions  h(-,  v„(-))  +  M4(-)  +mi|un(-)|r’), 
it  yields  : 

J  h(s,  v)  ds  <  lirrnnf  J  h(s,  vn)  ds  <  0, 

and  we  also  have 

||u  -  v||i,r(r)  <  lirnmf  \\vn  -  u||Lr(r)  <  M. 

Moreover,  we  have  lim„  fr  g(s ,  vn)  ds  =  0  and  (because  (vn)n  converges  to  v  in 
Lf{T))  there  exists  a  subsequence,  still  indexed  by  n,  such  that  limn  /r  g(s,  vn)  ds  = 
fr  g(s,  v )  ds.  Therefore,  fr  g(s,  v)  ds  =  0  and  v  belongs  to  V (M). 

ii)  This  assertion  follows  from  Lemma  3.1  and  from  the  continuity  result  of  Theorem 
3.1. 

iii)  We  consider  {{vn)n,v}  C  V(M),  such  that  (vn)n  converges  to  v  for  the  metrics 

dE-  Recall  (see  (i))  that  (vn)n  also  converges  to  v  for  the  weak  topology  of  Lr(T).  We 
complete  the  proof  thanks  to  the  assumptions  on  F,  G  and  thanks  to  the  continuity 
results  stated  in  (ii).  (In  particular,  we  use  the  convexity  of  G(s,y,  •)  and  estimates 
on  G  to  prove  the  lower  semicontinuity  of  v  \ — >  J{yv,  «)■)  □ 

3.3.  Diffuse  perturbations. 

Lemma  3.2.  Let  u1;  v2  and  v3  be  in  V  and  let  iq  be  in  C(Q).  For  every  0  <  p  <  1, 
there  exists  a  sequence  of  measurable  subsets  {Ep)n  in  T  such  that 

(3.2)  cn-\e;)  =  PcN~\ r), 

(3.3)  /  h(s,vi)ds+  [  h(s,  v2)  ds  =  (1  —  p)  [  h(s,  uj)  ds  +  p  h(s,  vf)  ds, 

Jr\Ez  JEp  Jr  Jr 

(3.4)  /  g(s,  vf)  ds  +  f  g(s,  v2)  ds  =  (1  -  p)  I  g(s,  ?q)  ds  +  p  g(s,  v2)  ds, 

Jr\E™  Je%  Jr  Jr 
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—  V-i\r  ds , 


measur- 


(3.5)  [  bi  - -i^rds  +  [  |^2  — 

Jr\E£  Je% 

=  (1  ~  P)  J  K  -  v3\r  ds  +  p  J^\v2  -v3\r  ds, 

(3.6)  [  (G(s,y1,v2)-G(s,y1,v1))ds  =  p  [{G(s,yuv2)-G(s,yi,vi))ds, 

Je"  Jr 

(3.7)  -XB"  — >■  1  weakly-star  in  L°°(r)  when  n  tends  to  infinity, 

p  p 

where  xe*  is  the  characteristic  function  of  E™. 

Proof.  This  lemma  is  an  easy  consequence  of  the  Lyapunov  convexity  Theorem  (see 
[20]). 

Theorem  3.4.  Let  vi,  v2  and  v3  be  in  V.  For  every  0  <  p  <  1,  there  exists  a  measur¬ 
able  subset  Ep  C  T  such  that 

(3.8)  cn-\ep)  =  pCN~\ r), 

(3.9)  /  h(s,vi)ds+  f  h(s,v2)ds  =  (1  -  p)  [  h(s,vi)ds  +  p  [  h(s,v2)ds, 

Jr\Ep  JEf,  Jt  Jt 

(3.10)  g(s,Vi)ds+JE  g(s,  v2)  ds  =  {l  -  p)  g(s,  Vi)  ds+p  g(s,  v2)  ds, 

(3.11)  [  \vx  -  v3\r  ds  +  (  \v2-v3\rds 

Jr\Ep  J  Ep 

=  (1  -  p)  J  \vi  -  v3\r  ds  +  p  J^\v2  -v3\Tds, 

(3.12)  Je  ( G(s ,  yi,v2)  -  G(s,  yi,vi))  ds  =  p  Jr{G(s,  yi,v2)  -  G{s,  yu  vj)  ds, 

(3.13)  yP  =  Vi+pz  +  rp,  1™  ^INIc(n)  =  0, 

(3.14)  J(yP,vp)  =  J(yi,vi)  +  pAJ  +  o(p), 
where 

(\_jv i(s)  '/s€r\^ 

Vp  S  \v2(s)  if  s  £  Ep, 

yp  and  y3  are  the  weak  solutions  of  (2.1)  corresponding  respectively  to  vp  and  to  v3, 
z  is  the  weak  solution  in  7T1  (T2)  of 

dz 

Az+$'y(x,y1)z  =  0  inil,  - — +%{s,yi,vi)z+^(s,y1,v2)-^(s,y1,v1)  =  0  inT 


A  J  = 


J  (G'y&y^vjz  +  G(s,yi,v2)  -  G(s,yuvi))  ds. 


and 
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Proof.  Let  (Ep)n  be  the  sequence  of  measurable  subsets  defined  in  Lemma  4.1.  We 


set 


U(s)  if  S  e  r  \ 

I  M*) 


if  s  e  Enp. 


Let  ynp  be  the  solution  of  (2.1)  corresponding  to  vnp  and  let  2  be  the  function  defined 
in  the  statement  of  Theorem  3.4.  It  is  clear  that  C”  =  {ynp  -  Vi)/p  -  z  is  the  weak 
solution  in  of 

d( 


M  +  <C  =  4>nP  in 


On  A 


+  K{  =  rp+Vnp  on  T, 


where  a™  =  &y(x,yi  +  t{ynp  -  yi))dt,  bnp  =  /q1  T;(s,?/i  +  t{ynp  -  Vx),vnp)dt ,  fnp  = 
($(,(z,  yx)-afjz ,  ipnp  =  (^(s,  yi,vx )~bnp)z,  ynp  =  (l-ixB?)(^(s,  yu  u2)-T(s,  yu  vx)) 
and  xe »  is  the  characteristic  function  of  TJ".  We  denote  by  C”1  the  solution  in  H1^) 
of 


AC  +  n"C  =  4>np  in  ft, 

by  (f  the  solution  in  iLx(Q)  of 


5C 


+  bnX  =  C  on  r, 


AC,  +  n"C  =  0  in  ft, 
and  by  the  solution  in  22 1(fl)  of 

A£  +  a£  =  0  in  ft, 
where  a  =  $(,(:£,  j/i),  6  =  $(,(s,  t/i,  Ui).  We  also  have 


^ <  on  r, 


A(Cpn-Cpn2)+a”(C;-Cf)  =  (n”-n)C  in  ft, 


0(.£p  Cp  )  ,  un/tn  cn2\_(hn_ 


dnA 


+bnp(Cp-Cf)  =  (bnp-b)^onT. 


Notice  that  (a,  b)  and  (ap,bp)  satisfy  the  ellipticity  condition  (Em).  Thanks  to  The¬ 
orem  3.3,  we  have 


(3.15) 


iiefihn)  <  c6(\\rp\\ Lw  +  \\rp\M, 


(3.i6)  iic;2  -  e;iic(n)  <  Os(K  -  +  \K  -  &lU'-(r))iicil^)- 

The  operator  T  which  associates  £,  the  solution  in  H1  (ft)  of 

A£  +  a4  =  0>  Jr-  +  b£  =  <p, 

oriA 

with  </?,  is  continuous  from  27 (T)  into  Ca(Cl )  (see  Theorem  3.3).  Since  the  embedding 
from  (7“  (ft)  into  C(Cl)  is  compact,  the  operator  T  is  compact  from  Lr(T)  into  C(fl). 
Because  of  (3.7),  for  every  0  <  p  <  1,  the  sequence  {tp”)„  converges  to  zero  for 
the  weak  topology  in  2/(T),  therefore  the  sequence  (£;)„  converges  to  zero  in  C(ft). 
There  then  exists  an  integer  depending  on  p,  denoted  by  n(p),  such  that 

IIC(p)lho)  <  p- 
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Since  we  have 

Jr  K(p)  -  uilr  ds  =  JE^  l^2  -  v'\r  ds - 

(Vp^)p  converges  to  Vi  in  Z7(T)  and  (f/”(p))p  converges  to  y\  in  C'(Q).  Prom  assump¬ 
tions  on  $  and  we  deduce  that,  (<^(p))p,  (a”(p)  -  a)p  both  converge  to  zero  in 
Lq(£l)  and  (tpp^) P,  (h"^  —  b)p  both  converge  to  zero  in  Lr(F)  when  p  tends  to  zero. 
Thus,  thanks  to  (3.15)  and  to  (3.16),  we  get 

Km  |IC»llc(ii)  <  lim  lli^'llc®  +  Km  ||(jM  -  Q»\\cm  +  Bm  ||«~<-> ||c(fi)  =  0. 

Now  we  set  Ep  =  E^p\  -prp  =  ("(p)  and  vp  =  u"(p),  conditions  (3.8)  to  (3.13)  are 
clearly  satisfied.  Moreover,  taking  into  account  (3.12),  (3.13)  and  the  definition  of  vp 
we  easily  verify  (3.14).  □ 

Remark.  Thanks  to  (3.9)  and  to  (3.10),  we  conclude  that,  even  if  V  is  not  patch 
complete  (in  the  sense  given  in  Introduction),  for  every  vi,v2  G  V,  we  can  construct 
a  diffuse  perturbation  (vp)p,  of  v\  by  v2,  such  that  vp  still  belongs  to  V.  By  setting 
v2  =  v  in  (3.11),  we  can  see  that  for  M  >  0,  if  vi,v2  G  V(M)  then  vp  belongs  to 
V(M),  for  every  0  <  p  <  1. 

4.  Proof  of  the  Pontryagin  principle 

4.1.  Penalized  problem.  Let  |  •  |C(q)  be  a  norm  on  C(Q).  equivalent  to  the  usual  norm 
II  '  llc(n)>  such  that  (C(Cl),  \  ■  | )  be  strictly  convex  and  M(Cl),  endowed  with  the 
dual  norm  of  |  •  |c(n)  (denoted  by  |  ■  ^(h)),  be  also  strictly  convex  (see  [8],  Corollary 
2  p.  148,  or  Corollary  2  p.  167).  We  have 

(4.1)  limaupl(y'  +  ^rlciai~l<y,>T(il> 

pXO,  p 

=  max{(£, z)M(fy xC(Q)  |  £  G  5|(-)+lc(h)(<y3)} 

for  every  ip,  z  €  C'(ST) ,  where  9|(-)+lc(n)  is  the  subdifferential,  of  |(-)+lc(h)>  in  the 
sense  of  convex  analysis  [7]  and  (-)+  =  max(-,  0).  Therefore,  for  a  given  p  G  C(Q)  we 
have 

(4.2)  (C)-2-^)^(n)xc(n)+l¥,+lc'(o)^l2:+lc(o)'7'  £  e  <9|(-)+lc(n) (ip),  V  z  G  (7(12), 

l£l.M(fi)  <  1  for  every  £  G  <9|(-)+lc(fi)(^)- 

Moreover  it  is  proved  in  ([17],  Lemma  3.4)  that,  since  <9| (-)+ lc(n) (w)  is  convex  in 
A4(n)  and  (AT(Sl),  |  •  |a^(S))  is  strictly  convex,  then  if  <p+  ±  0,  9|(-)+lc(h)(7’)  is  a 
singleton  and  |(-)+lc(S)  is  Gateaux-differentiable  at  tp. 

Let  (y,  v)  be  a  solution  of  ( P ).  We  consider  the  penalized  functional 

uv,v)  =  {[(j(y,v)  -  j(y,  v) + + m-M-miiti)}* 

and,  for  every  n  >  0,  we  define  the  penalized  problem 

(Pn)  inf {Jn(y,v)  |  (y,v)  G  C(Cl)  x  V{Mn),  {y,v)  satisfies  (2.1)}, 
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where  Mn  =  n^~^.  It  is  easy  to  see  that  for  every  n  >  0,  (y,v)  is  ^-solution  of 
(Pn).  Thanks  to  the  Ekeland’s  principle,  there  exists  vn  £  V(Mn)  such  that 

(4.3)  dj^[yn^v)  ^  ,  Jniyni^-n)  ^  dni.Vv'i'd)  4  d]q;{vn^v) 

71  71 

for  every  v  £  V{Mn )  (yn  and  yv  are  the  solutions  of  (2.1)  corresponding  respectively 
to  vn  and  to  v). 

4.2.  Proof  of  Theorem  2.1. 

1.  Approximate  optimality  conditions  for  the  boundary  control  vn  satisfying  (4.3). 

Let  v0  be  in  V.  For  n0  large  enough,  v0  £  V(Mn )  for  every  n  >  no-  Apply¬ 
ing  Theorem  3.4,  we  deduce  the  existence  of  measurable  subsets  Ep,  such  that 
C"-\E>)  =  PCN-\Y ), 

(4.4)  [  h(s,vn)ds  +  I  h(s,v0)ds  =  (l-p)  /  h(s,vn)ds  +  p  h(s,v0)ds , 

Jr\EZ  Je$ ;  Jr  Jr 

(4.5)  [  g(s,vn)ds+  f  g(s,v0)ds  =  {1  -  p)  f  g(s,vn)ds  + p  g{s,v0)ds, 

jt\e&  Je%  Jr  Jr 

(4'6)  A«K""r<,S  +  /Bh_Sr* 

=  (1  -  p)  J  \vn  -v\r  ds  +  p  I  Vo  -  v\r  ds, 

(4.7)  ypn  =  y„  +  pzn  +  rp,  lim  ^|K||c(n)  =  0, 

(4.8)  J(yp,  <)  =  J(yn,  vn)  +  pAJn  +  o(p), 
where  vp  is  defined  by 


€(s)  = 


vn(s)  on  T  \  Ep, 
V0(s)  on  Ep, 


ypn  and  yn  are  the  state  variables  corresponding  respectively  to  v p  and  to  vn,  zn  is  the 
weak  solution  of 

Qz 

A.zn  +  yn)zn  =  0  in  O,  ^  2/n>  VnjZn  ~  ^n)  Uni  ^o)  on  T, 

and 

A  Jn  =  J C  Fy(x,  y„{x))zn{ x)  dx  +  Jr  G’y(s,  yn{s),vn{s))zn(s)  ds 

+  J  [G(s,  y„(s),  v0{s))  -  G{s,  yn(s),  v„(s))]  ds. 

On  the  other  hand,  thanks  to  (4.4),  (4.5)  and  (4.6),  for  every  n  >  n0  and  every 
0  <  p  <  1,  vp  belongs  to  V(Mn).  If  we  set  v  =  vp  in  (4.3),  it  yields 

JnjynjVn)  ~  JuiVn-^’n)  j^£iV-l/p\ 
p— >0  p  ~  n 


(4.10) 
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Taking  (4.1),  (4.8)  and  the  definition  of  Jn  into  account,  we  get 

(4.11)  Vn AJn  (/hi*  fy{yn)^n) (■ 
where 

I  ./  I  7/„  .  i;„  i  —  »/  I  II  .  iv  i  t  - 

(4.12) 


(J(yn,vn)  -  J(y,v)  +  ^) 


d-niVn ;  ^n) 


(4.13) 


Mn 


'(/fa"))+'v£|lnr))+'C(m  if  U{Vn))+  ^  0, 

0  if  not. 


For  every  n  >  n0,  we  consider  pn  the  unique  weak  solution  of 

(  -d  Pn  T  y{% i  ynjVn  ~  PnFy{x lUn)  T  pn^fyi'^Vn)  ^ 

(4.14) 

v  dn’l*  4“  ^  y  ^  '  Vn '  '^n)  Pn  =  Gy  ( S >  Un :  Vn )  T  pnT  fy[‘  iVn)  Oa 
which  satisfies  the  Green  formula  : 

(4.15)  jT  pn(Ay  +  $(,(:r,  y„)y)  dx  +  ^  Pn(J^  +  ^y(s’  ds 

=  vnFy(x,  yn)y  dx  +  ^  vnG'y(s,  yn,  vn)y  ds  +  (y„,  /' (•,  y„)y}.M(h)xC(fi)> 

for  every  y  G  (y  G  tf^Q)  |  Ay  €  Lg(Q),  G  Lr(r )}  (ynfi  is  the  restriction  of  pn 
to  fl  and  y„r  is  the  restriction  of  pn  to  T).  With  this  Green  formula,  with  (4.11)  and 
the  definition  of  AJn,  we  get 

(4.16)  J  [PnG(syyn,vn')  yn^(s,  yn,  un)]  ds 

<  [  [pnG(s,yn,v o)  -ynlk(s,yn,uo)]  ds  +  -T^-1^) 
vr  n 

for  every  n  >  no- 


2.  Convergence  of  sequences  ( vn)n ,  (y„)n  and  (yn)„. 

We  remark  that 

KIvLtfft)  +  =  1- 

The  sequences  (z/„)ra  and  (y„)„  are  respectively  bounded  in  E  and  in  A4(fi),  there 
then  exist  P  G  R+,  p  G  M(£l)  and  a  subsequence,  still  denoted  by  {p„,pn)n,  such 
that 

(4.17)  vn  — >  P,  pn  — *•  p  weakly  star  in  M (£2). 

Let  1  <  r  <  N/(N  —  1)  for  which  the  following  embeddings  are  continuous 

w1-T(«)  Lq\n),  wx-?’r(r)  Lf'(r). 

Prom  Theorem  3.2,  it  yields 

||Pn||ivi.T(fi)  <  C5{pn\\Fy(-,yn)\\Li{n)  +  Pn\\G'y(-,y„,vn)\\Li(r)  +  \pn\M(Q)\fy(^yn)\c(n)}- 
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Since  the  sequences  (vn)n,  (p n)n >  ( yn)n  and  (yn)n  are  bounded  respectively  in  R, 
M(Q),  C(Cl)  and  in  Lf(T),  the  sequence  (pn)n  is  bounded  in  Wl'T(Vt).  There  then 
exist  p  6  W’1,T(n)  and  a  subsequence,  still  denoted  by  (p„)„,  such  that  (pn)„  weakly 
converges  to  p  in  Wl’T(Q).  Therefore,  (pn)„  weakly  converges  to  p  in  (Cl)  and  the 
sequence  of  traces  (p„|r)«  weakly  converges  to  the  trace  p|r  in  Lr,(T).  Let  us  prove 
that  p  is  a  weak  solution  of  equation  (2.4). 

Let  ip  be  in  W1,00(fi),  for  every  n  >  n0,  we  have  : 

,  N  f 

(4.18)  JE  ciijDiPnDjp  +  &y(x,  yn)pn<p}  dx  +  ^y(s,  yn,  Vn)pnip  ds 

Jn  i,j= 1  ■/r 

=  J ^  vnFy(x,  yn)p  dx  +  Jr  vnG'y{s ,  yn,  vn)<pds  +  (pn,  /'(•,  yn)<p)M(n)xC(n), 

and  the  Green  formula  (4.15).  Moreover,  since  fr  \vn  —  v\r  ds  <  (dE(vn,v))— M*  < 
(L)1^,  (vn)n  converges  to  v  in  Z/(r)  and  (yn)n  converges  to  y  in  C(f2).  Thanks  to 
assumptions  on  4>,  T,  F,  G,  f,  we  have 

-  $y(->2/)lknn)  =  °>  II Fli-,Vn)  -  Fy(-,y) lUqn)  =  °> 
lim||^(-,t/n,un)  -  ^(-,  j(,C)|Uf(r)  =  0,  lim|| Gy(-,yn,vn)  -  G'y(-,y,v) ||ii(r)  =  0, 
!im  !/,)(•, 2/n)  -  fy(',y)\c(n)  =0. 

Thus,  by  passing  to  the  limit  in  (4.18)  and  in  (4.15),  we  prove  that  p  is  the  unique 
weak  solution  of  the  equation  (2.4)  which  satisfies  the  Green  formula  : 

(4.19)  p(Ay  +  &y(x ,  y)y)  dx  +  ^,p(^  +  V>  %) ds 

=  f  v F'y(x,y)ydx  +  f  PGy(s,y,v)y ds  +  </x, /'(-, y]y)M(.n)xC(n), 
i/0  J  r 

for  every  y  G  {y  G  Hl( Q)  |  Ay  e  Lq (CL),  ^  G  Lr(P)}.  Because  of  uniqueness  of  the 
weak  solution  of  (2.4)  satisfying  (4.19),  we  can  deduce  by  classical  arguments  that  p 
is  independent  of  r  and  that  all  the  sequence  (p„)n  weakly  converges  to  p  in  W1,T(n) 
for  every  1  <  r  <  N/ ( N  —  1). 

3.  Pontryagin’s  principle. 

Notice  that  (vn)n  tends  to  v  in  Lf(r).  By  letting  n  tend  to  infinity  in  (4.16),  with 
Fatou’s  Lemma  (applied  to  the  sequence  of  functions  unG( •,  0,  vn(-)))  and  the  conver¬ 
gence  results  stated  in  step  2,  we  obtain 

(4.20)  J^H(s,y(s),v(s),p(s),P)ds  <  J^H(s,y(s),v0(s),p(s),v)  ds, 
for  every  vq  G  V. 

On  the  other  hand,  from  the  definition  of  pn  and  from  (4.2),  we  deduce 

(4.21)  (pn,z-  f{-,yn))M{n)xcm  <  0  for  every  z  G  {z  G  C(Cl)  |  z  <  0}. 

By  passing  to  the  limit  in  this  expression,  we  obtain 

(4.22)  (fi,  z  -  f(-,y))M<fl)xcm  <  0  for  every  z  G  {z  G  C(Cl)  \  z  <  0}, 
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which  is  equivalent  to  fi  >  0  and  {fi,  f  (■ ,  y))  M{n)xC{n)  —  0-  To  prove  that  (v,  fj)  is 
nonzero,  we  recall  that  v\  +  \/J-n\2M^)  =  ^  v  >  0,  the  proof  is  complete.  If  v  —  0, 

we  can  prove  that  >  0  by  using  lim„  \y.n\M(n)  =  1-  Indeed  there  exists  a  ball 

B(z ;  2e)  c  {z  £  C(Cl)  \  z  <  0}  centered  at  z  and  with  radius  2e  >  0.  We  can  choose 
zn  e  B( 0;2e)  such  that  {l^n,  zn)  M{n)xC{n)  =  e\Hn\M(n)-  Since  z  +  zn  €  {z  €  C(Q)  \ 
z  <  0},  from  (4.21),  we  get 

(/bn  z  +  zn  /(•,  yn))  —  0- 

By  passing  to  the  limit,  we  get 


thus  |U^0. 


e+  <  fl,  z  —  f{',y))M{Q)xC(Q)  <  0) 
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Designing  for  Optimal  Energy  Absorption  II, 
The  Damped  Wave  Equation 

STEVEN  J.  COX 

Department  of  Computational  and  Applied  Mathematics 
Rice  University 

Abstract.  We  consider  the  wave  equation  in  a  bounded  domain  with  zero  Dirichlet  data 
and  damping  proportional  to  velocity  and  pose  the  problem  of  minimizing,  with  respect 
to  damping,  the  maximum,  over  all  initial  data  of  unit  energy,  of  the  infinite  time  integral 
of  the  instantaneous  energy.  We  show  the  minimum  to  exist  over  those  dampings  that 
uniformly  avoid  zero  and  infinity.  We  provide  an  exact  minimum  over  the  class  of  constant 
dampings  and  proceed  to  show  it  to  be  a  critical  point  over  the  class  of  bounded  dampings. 

1991  Mathematics  Subject  Classification.  35B20,  35L05,  47A55 

Key  words  and  phrases.  Eigenvalue,  damping,  Lyapunov,  optimal  design,  perturbation. 


1.  Introduction 

In  [1]  we  minimized  two  standard  merit  functions,  decay  rate  and  greatest  total 
energy,  over  a  class  of  finite  dimensional  damped  linear  systems.  Analogous  results 
for  the  decay  rate  in  the  context  of  the  damped  wave  equation  can  be  found  in  [5], 
[4],  and  [2].  In  this  note  we  minimize,  over  a,  the  greatest  total  energy  associated 
with  the  damped  wave  equation, 

(1.1)  utt{x,t)  -  Au(x,t)  4-  2a(x)ut(x,t)  =  0,  u(-,t)  6  Hq(Q) 

on  the  open  bounded  connected  set  D  C  Rd.  The  greatest  total  energy  is  simply 
the  maximum,  over  all  initial  data  of  unit  energy,  of  the  infinite  time  integral  of 
the  instantaneous  energy.  In  order  to  make  these  notions  precise  let  us  take  U(t)  = 
[ u(t )  ut{t)\  and  interpret  (1.1)  as  Ut  =  A(a)U  where 

A(a)=(°  _72a),  C(i)  =  (iI2(!l)nfl’(ll))xff’(Sl), 

is  densely  defined  in  the  Hilbert  space  X  =  Hq(TI)  x  L2(Q)  with  inner  product 
{[f,g\,[uA)x  =  J^Xf  -Xu  +  gvdx. 

This  A(a)  is  the  infinitesimal  generator  of  a  semigroup  T(t;  a)  and  U(t)  =  T(t;  a)V 
solves  the  Cauchy  problem  Ut  =  A(a)U,  U(0)  =  V.  The  associated  instantaneous 
and  total  energies  are,  respectively 

r° c 

E{t)  =  \\T{t-a)Vfx  and  ^  \\T{t-a)Vfxdt. 


This  work  was  supported  by  NSF  Grant  DMS-9258312. 
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The  greatest  total  energy,  71(a),  is  now  the  supremum  of  the  total  energy  over  all 
initial  data,  V,  of  unit  energy.  That  is, 

poo 

(1.2)  T1(a)=  sup  /  \\T(t-,a)V\\%dt. 

||V||x=l  Jo 

In  §2  we  show  that  T\  (a)  is  finite  over  the  class  of  bounded  nonnegative  a  and  in  fact 
that  it  attains  its  minimum  there.  In  §3  we  compute  7) (a),  by  hand,  for  constant  a 
and  so  readily  identify  the  best  constant  damping.  In  §4  we  show  this  best  constant 
to  be  a  critical  point  of  7)  in  the  class  of  bounded  a. 

2.  Existence  of  an  Optimal  Design 

We  denote  by  {A,,}^  the  increasing  sequence  of  eigenvalues  of  -  A  on  Hq(Q)  and 
by  {gn}^Ti  the  corresponding  orthonormal  base  of  eigenfunctions. 

We  choose  6  small  and  positive  and  define 

ad={ae  L°°(Q)  :  Sy/xj 2  <  a(x)  <  J\i/{28)} 

Prom  the  literature  we  may  conclude,  for  such  a,  that  7)  (a)  is  finite  and  that  the 
total  energy  may  be  represented  as  a  quadratic  form.  More  precisely, 

Theorem  2.1.  If  a  G  ad  then 

(2.1)  l|T(t;a)|||w<4e-^,  t  >  0, 

and  there  exists  an  Hermitian  positive  semidefinite  endomorphism  B(a)  on  X  for 


which 

(2.2) 

2(B(a)A(a)V,V)x  = -\\V\\2X, 

VP  e  D{A), 

and 

(2.3) 

poo 

{B(a)V,V)x  =  Jo  \\T(t-,a)V\\*xdt, 

MV  e  X. 

Proof.  The  estimate  in  (2.1)  follows  directly  from  Theorem  1  of  Rauch  [8].  Datko  [6] 
has  shown  that  (2.1)  is  a  necessary  and  sufficient  condition  for  the  existence  of  the 
stated  B.  ■ 

As  ad  is  compact  with  respect  to  the  weak*  topology  we  need  only  show  a  >->  %_  (a) 
to  be  weak*  lower  semicontinuous.  Let  us  denote  weak*  convergence  by  A,  weak 
convergence  by  — *■,  and  strong  convergence  by  — >. 

Theorem  2.2.  If  {a„}  C  ad  and  an  A  a  in  L°°(Q)  then  T(t\  an)V  — >  T(t;  a)V  in  X 
for  each  V  G  X. 

Proof.  Recalling  Kato  [7,  IX.2.16]  we  note  that  it  suffices  to  show  that  A-1(an)R  — > 
A-1(a)P.  Set  V  =  [f,g]  and  define  [j/„,2„]  =  A~l(an)[f,g]  and  [y,z]  =  A~1(a)[f,  g) 
so 

[y„,Zn]  =  [A~1(2anf  +  g),f]  and  [y,  z]  =  [A"1(2a/  +  g),  /]. 

As  a  result, 

(2.4)  || A-^aJV  -  A-\a)V\\x  =  2||VA-1(an  -  a)/||2, 
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where  ||  •  H2  denotes  the  L2(Cl)  norm.  If  wn  =  A  x(a  —  an)f  then  wn  satisfies 
-Aw„  =  (an  -  a)f,  wn  £  Hq(CI). 


Upon  integrating  each  side  against  wn  we  find 


(2.5) 

IWl!  <  T-||Vrun||!  < 

Ai 

^WfUwnh- 

As  a  result 

\\wn\\2  <  6^-\\fh  and 

-| 

l|Vmn||2  <  -g\\fh 

From  these  bounds  it  follows  that  wn  — >■  w  in  Hq(Q).  Hence,  passing  to  the  limit  in 
the  weak  form 


we  find 


f  V wn  ■  V (/> dx  =  f  ( an  —  a)f<t>dx  V (j>  E  Hq(£1) 

J  n  in 

/  Vw  •  Vfidx  =  0  \r<i>eHUn), 

in 


i.e.,  in  is  identically  zero  in  fi.  As  wn  — >  0  in  £2(f2)  it  now  follows  from  (2.5)  that 
w„  — >  0  in  ffo(H).  Recalling  (2.4),  we  have  shown  that  A_1(a„)U  — >  A_1(a)U  in  X. 


Theorem  2.3.  If  an  a  in  then  71(a)  <  limmf71(an). 


Proof.  It  follows  from  the  previous  theorem  that  \\T(t;an)V\\x  — >  ||T(i;a)V||*  for 
each  t  >  0.  In  addition  it  follows  from  (2.1)  that  t  \\T(t;  an)V\\x  is  uniformly 
dominated  by  an  integrable  function.  The  Lebesgue  dominated  convergence  theorem 
now  yields 

Jr  co  roo 

1  \\T(t;an)Vfxdt^  \\T(t-a)V\\\dt, 

0  io 

or,  in  the  language  of  (2.2),  (B(an)V,V)x  — >  (B(a)V,V)x.  By  the  nature  of  the 
supremum  in  (1.2)  it  follows  that  to  each  e  >  0  there  corresponds  a  unit  vector  Ve 
for  which 

7j(a)  — e  <  (B(a)Ve,Ve)x- 


In  addition,  on  taking  the  limit  inferior  of  each  side  of  (B(an) Ve,Ve)x  <  71(an)  we 
find 


(B(a) Ve,Ve)x  <  liminf  7i(a„), 

'  n— >00 


and  hence 


71(a)  —  e  <  liminf  71(a„). 

x  7  n—y  00 


As  e  is  arbitrary  our  claim  has  been  established.  ■ 


Corollary  2.4.  a  1— >  71(a)  attains  its  minimum  on  ad. 

As  a  candidate  for  the  global  minimizer  we  now  carry  out  the  exact  minimization 
of  7)  over  constant  a. 
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3.  The  Case  of  Constant  Damping 

In  a  manner  analogous  to  the  case  of  friction  damping  of  finite  dimensional  systems, 
see  [1,  §2],  we  obtain  an  explicit  representation  of  B(a). 

Theorem  3.1.  If  a  is  constant  then 


Proof.  It  is  a  simple  matter  to  check  that  this  operator  is  an  Hermitian,  positive 
semidefinite  endomorphism  on  X  satisfying  (2.1).  There  are  a  number  of  means  by 
which  this  B(a)  may  be  derived.  One  way  is  to  note  that  for  smooth  solutions  the 
instantaneous  energy  obeys 

Eft)  =  +  Jjfuut  +  au2)dx^  =  —^(B(a)U(t),U(t))x, 

and  hence  that  the  total  energy  is  simply  ( B(a)V ,  V)x.  A  second  approach  is  to  note 
that  B(a)  is  (formally)  a  solution  to  the  associated  Liapunov  equation  A*(a)B(a )  + 
B(a)A(a)  =  —I.  ■ 

The  eigenvalues  of  B(a)  are 


1  a  ±  \/a2  +  Xn 

™  =  Ya  + - 2^ - ' 


n  =  1,2, 


and  its  (unit)  eigenvectors  are 


(3.2) 


k±n  —  Abfcn[l  £±n]  Qn 


where 

(3.3)  c±n  —  —  a  ±  \] a2  +  Xn  and  fi±n  =  (Xn  +  c\n) 

The  greatest  of  the  T±n{a)  is  Ti{a)  and,  as  Ai  is  simple,  so  too  is  Ti(a).  With  this 
explicit  expression  in  hand  we  may  easily  establish  (compare  [1,  Theorem  2.4]) 

Corollary  3.2.  The  greatest  total  energy,  7)  :  R+  — >  R+,  is  strictly  convex  and  attains 
its  qlohal  minimum  at 

.  rr  /V^T 

“-Ay— 


Its  minimum  value  is 


Ti  (a)  = 


1  v^+2  IV 5-1 

2  V  2 


We  record  for  future  use  the  fact  that 
(3.4)  c2  =  2a2  +  Ai 

follows  on  substitution  of  a  into  (3.3) 
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4.  The  Perturbed  Operator 

We  now  show  that  a  is  in  fact  a  critical  point  for  71  :  ad  — >  R+.  In  particular,  we 
shall  show  that,  for  each  b  €  L°°(Q,), 

71  (a  +  Kb)  =  71(a)  +  0(k2),  as  k  — >  0. 

In  fact  we  compute  the  gradient,  971,  at  an  arbitrary  admissible  constant,  a.  To 
begin,  we  fix  b  G  L°°{ fi)  and  consider  A(k)  =  A0  +  kA\  where 

and  Xbj- 

We  denote  by  T(t;  k)  the  semigroup  generated  by  A(k)  and  recall,  see,  e.g.,  Kato  [7, 
IX.2.1],  that,  for  fixed  f,  kh  T(i;  k)  is  entire  and 

OO 

T(t;K)  =  J2^Tn(t) 

71=0 

where  T0(t)  is  the  semigroup  generated  by  A0  and 

T„+i(t)  =  f  To(t  —  s)A\Tn(s)  ds,  n  >  1. 

70 

If  k  is  sufficiently  small  then  a  +  Kb  will  lie  in  ad  and  so,  by  Theorem  2.1,  there  exists 
a  B(a  +  Kb).  We  now  express  this  as  a  power  series  in  k  by  following  the  construction 
in  Datko  [6].  In  particular,  we  define, 

ft  °° 

j B(t-,n)=  /  T*(s-,K,)T(s-,K)ds-J2KnBn(t), 

n—0 

where 

B0(t)  =  f  Tq(s)T0(s)  ds  and  Bn{t)  =  f{T^s)Tn(s)  +  Tn(s)T0(s)}  ds. 

Jo  Jo 

As  t  — >  oo  each  Bn(t)  converges  in  the  strong  operator  topology  to  an  operator  Bn. 
It  follows  that 

(X) 

B(a  +  Kb)  =  ^ 

71=0 

is,  in  the  language  of  Kato  [7,  §VII.7],  a  selfadjoint  bounded-holomorphic  family  of 
operators.  As  such  we  may  avail  ourselves  of  the  perturbation  series  of  [7,  §11.2.2]. 
In  particular,  as  71(a)  is  simple,  the  greatest  eigenvalue,  71  (a  +  Kb),  of  B(a  +  Kb) 
satisfies 

OO 

71  (a  +  Kb)  =  71(a)  +  £  kTT^, 

71=1 

where  7j  may  be  expressed  in  terms  of  the  eigenvectors  of  B{a).  More  precisely, 

=  (B1V1,V1)X  and  =  (B2VU  Vjx  +  E 
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where  Tn  and  Vn  are  the  eigenvalues  and  eigenvectors  of  B(a),  see  (3.1)-(3.3).  Ac¬ 
cordingly  we  begin  the  evaluation  of 


roo 

(BtVuVjx  =  l  ({TSWi(t)  +  Ti  (t)T0(t)}Vi,Vi)x  dt 

poo 

=  2/  (T&yV^ToWVdxdt. 

JO 

It  will  be  convenient  to  suppose  that  a 2  <  Ai  and  to  label  the  frequencies 


U) 


’j  —  yfid 


As  V\  is  a  constant  multiple  of  qi  and  Aq  is  a  constant  coefficient  operator  it  follows 
that  T0(t)Vi  is  simply 


To(t)Vi  —fiie 


cos 


(coit)  +  ( ci  +  a)  sin(ujit)/uji  \ 


^ci  cos(duit)  —  (aci  +  Ai)  sin(wit)/wi  j  qi 

For  ease  of  reference  let  us  express  this  as 


r„«)v,  =  feg),9l. 


Regarding  Ti(t)V\  we  must  first  compute  T0(t  — 1s)A1T’0(s)Vi.  This  is  the  solution,  at 
time  t  -  s,  to  W'  =  A0W  subject  to 


W(0)  =  AiT0(s)Vi  =  -2 bqiy[{s)  L 


We  find,  via  separation  of  variables,  that 


W(t  -  s)  =  E  _  S))/Ui)  «>*■ 


Hence, 

Ti(t)Vi  =  J*  W(t  -  s)  ds  =  -2  £  (pfe))  qJbqi’  «*)» 

where 

ctj(t)  =  J  e_a(t_sVi(s)sin(wj'(f  _  s))/a;t  ds. 

It  follows  that 

{TMVuToiQVOx  =  -2<&<7i,<zi>26(*) 


where 

6  (t)  =  Aic*i(t)yi(<)  +a'1{t)y'1(t). 


It  is  important  to  note  that  is  simply  a  product  of  sums  of  terms  involving  sine, 
cosine,  and  the  exponential  and  so  may  be  integrated  by  hand.  Its  length  (greater 
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than  one  page)  however  precludes  us  from  effectively  presenting  it  here  and  compels 
us  to  adopt  a  symbolic  means  (Maple)  in  the  final  evaluation  of 


Jroo  roo 

1  (T1(t)V1,T0(t)V1)xdt  =  -4(bquq1)2  6( t)dt 

o  Jo 


(bqi,qi) 


2  a2  —  c\  —  Ai 
2a2(Ai  +  c\)  ' 


If  b  =  1  this  indeed  agrees  with  what  one  finds  on  differentiating  the  7)  offered  in 
(3.1).  In  addition,  from  (3.4)  it  follow  that  7^  =  0  when  evaluated  at  the  best 
constant,  a. 

As  to  whether  or  not  a  is  a  local  minimizer  we  note  that  components  for  the 
calculation  of  7 are  all  here.  The  complexity  of  the  terms  however  has  rendered 
this  calculation  a  formidable  exercise. 

Even  with  T}2'1  in  hand  one  has  only  a  local  result.  A  numerical  study,  analogous 
to  [2],  is  currently  underway. 

Finally,  we  note  that  V  ( B(a)V ,  V)x  is  a  Liapunov  function,  see  [3],  with  which 
one  may  study  the  stability  of  the  trivial  solution  of  (1.1)  with  a  right  hand  side 
depending  on  u,  ut,  and  Vu. 
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at  infinity.  We  also  give  a  counterexample  showing  that  this  property  may  fail  when  the 
nonlinearity  is  of  superlinear  type. 
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1.  Introduction 

Let  Q  be  a  bounded  open  subset  of  of  class  C2m,  T  >  0,  u  a  nonempty  open 
subset  of  f2,  /  a  continuous  real  function  and  k  6  N  such  that  0  <  2 k  <  m.  The 

main  goal  of  this  work  is  the  study  of  the  approximate  controllability  of  the  following 

semilinear  equation  with  Dirichlet  boundary  conditions: 

f  yt  +  (-A)my  +  (-A)kf(y)  =  h  +  vXu  in  Q  :=  fi  x  (0,T), 

(1-1)  <  |^  =  o  ,  j  =  0,l,...,m—l  on  E  :=  <9D  x  (0,T), 

[  2/(0)  =  Vo  in 

where  v  is  a  suitable  output  control,  is  the  characteristic  function  of  u>,  v  is  the 
unit  outward  normal  vector,  h  e  L2(0,T  :  H~m( Q))  and  y0  £  L2(Q).  Due  to  the  term 
Xw  the  controls  are  assumed  supported  on  the  set  O  :=  u>  x  (0,  T).  Problems  as  (1.1), 
sometimes  known  as  Cahn-Hilliard  problems,  appear,  with  m  =  2,  in  the  study  of 
phase  separation  in  cooling  binary  solutions  and  in  other  contexts  generating  spatial 
pattern  formation  (see  [6],  [8]  and  the  references  cited  therein). 

We  recall  that  problem  (1.1)  satisfies  the  approximate  controllability  property,  at 
time  T  with  states  space  X  and  controls  space  Y,  if  the  set 

{  y(T,  ■  :  v)  :  v  G  T,  y  solution  of  (1.1)} 

is  dense  in  X. 

The  main  goal  of  this  paper  is  to  extend  the  approximate  controllability  results  on 
second  order  problems,  m  =  1  and  k  =  0  (see  e.g.  [9],  [10]  and  [7])  to  the  case  of 
higher  order  equations  for  which  the  maximum  principle  does  not  hold,  in  general. 
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Our  first  result  gives  a  positive  answer  when  /  is  assumed  to  be  sublinear  at  the 
infinity: 

Theorem  1.1.  Assume  that  f  satisfies  the  following  conditions:  there  exist  some  pos¬ 
itive  constants  C\  and  C2  such  that 

(1.2)  |/(s)|  <  ci  +  c2|s|  for  all  s  eR 
and 

(1.3)  there  exists  f{s0)  for  some  s0  G  M. 

Then  problem  (1.1)  satisfies  the  approximate  controllability  property  at  time  T  with 
states  space  X  =  Z,2(0)  and  controls  space  Y  =  L2(0). 

In  contrast  to  the  above  result,  we  shall  prove  that  when  /  is  superlinear  the 
approximate  controllability  property  does  not  hold  in  general,  as  explained  in  Section 
4.  Therefore  if,  for  instance,  f(s)  =  |s|p_1s  Theorem  1.1  gives  a  positive  approximate 
controllability  result  for  0  <  p  <  1.  The  results  of  section  6  provide  a  negative 
approximate  controllability  answer  when  1  <  p  <  00.  The  similar  alternative  was 
obtained  in  Dfaz-Ramos  [7]  for  second  order  parabolic  semilinear  problems. 

We  remark  that  the  existence  of  solutions  in  the  class 

y  G  L2(0,T;tf0m(fi))  n  C([0,T];L2(F!)),  f{y)  G  L2(Q),  A kf(y)  G  L2(0,  T; 

is  also  obtained  as  a  by-product  of  Theorem  1.1  for  a  suitable  subclass  of  controls. 
The  uniqueness  of  solutions  can  be  easily  proved  if,  for  instance,  /  is  nondecreasing 
or  Lipschitz  continuous.  Those  uniqueness  results  are  not  needed  in  our  arguments. 


2.  Approximate  controllability  for  an  associated  linear  problem 

In  order  to  prove  Theorem  1.1  we  follows  the  same  scheme  of  proof  than  in  [9],  [10] 
and  [7] .  We  define  the  function 


5(5) 


f(s)  -  f{so) 

s  -  s0 


From  assumptions  (1.2)  and  (1.3)  we  have  that  g  G  fl  C(M).  The  conclusion 

will  be  derived  from  a  fixed  point  argument.  As  /(s)  =  f{so)+g(s)s  —  g(s)so,  we  shall 
start  by  considering  the  approximate  controllability  for  a  linear  problem  obtained  by 
replacing  the  term  f(y)  by 


9(z)y  +  f(s0)  -  g(z)s0, 


where  z  is  an  arbitrary  function  in  L2(Q).  Notice  that  when  z  —  y  this  expression 
coincides  with  f(y)  and  that  if  we  denote  g(z(t,x ))  :=  a(t,x)  and 

h(a)  :=  — (— A)fc/(s0)  +  (-A )k(a(t,  x)s0), 


(2.1) 
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then  a  £  L°°(Q)  and  h(a)  £  L°°(0,T;  H~2k(Cl)).  More  in  general,  given  a  £  L°°(Q ) 
and  h(a)  defined  by  (2.1),  we  consider  the  approximate  controllability  property  cor¬ 
responding  to  the  linear  problem 

(  yt  +  (-A)my+(-A)k(a(t,x)y)  =  h+h(a)+uxul  in  Q  :=  Clx(0,T), 

(2-2)  0=0  ,  j  =  0, 1, . . .  ,m  —  1  on  E  :=  dClx(0,T), 

l  2/(0)  =  2/o  in  Cl. 

Before  stating  an  approximate  controllability  result  for  this  problem,  following  Lions 
[14]  and  Fabre-Puel-Zuazua  [9],  [10],  we  consider  e  >  0  and  yd  £  L2{ Cl)  and  we 
introduce  the  functional  J  =  J(-',a,yd)  :  L2(Cl)  — ►  R  defined  by 

(2.3)  J(tp0;a,yd)  =  J(ip°)  =  ^  \<p(t,  x)\dxdtj  +e\\  <p°  ||£2(n)  -  J^ydip°,dx 

where  ip(t,  x)  is  the  solution  of  the  backward  problem 


(2.4) 


—ft  +  (—A )mf  +  a(t , x)Ak<p  =  0  in  Q  :=  x  (0, T), 

0=0  ,  j  =  0,l,...,m-l  on  E  :=  dCl  x  (0,T), 
ip(T)  =  in  Cl. 


To  study  the  above  backward  problem  we  introduce  the  space 


W  :=  {y  £  L2(0,  T;  H™(Cl)),  yt  £  L2(0,T-,H~m(n))}. 


The  following  result  will  be  used  later 


Proposition  2.1.  Given  h  £  L2(0,T;H  m(0))  and  y0  £  L2(Ci),  there  exists  a  unique 
function  y  £W  satisfying 

(  yt  +  (-A)my  +  a(t,  x)Aky  =  h  in  Q, 

(2.5)  J  0=0  ,  j  =  0,  l,...,m-  1  on  E, 

{  2/(0)  =  2/o  in  Cl. 

Furthermore,  we  have  the  estimate 

(2-6)  ||2/||i2(oiT;/r™(fj))  +  ll2/i|[L2(o,7yr-™(n))  <  G  (||/i||L2(0|r;ff-m(n))  +  ll2/olU2(n))  > 

where  the  constant  C  depends  only  on  M  :=[[  a  ||l°°(<3)  (provided  that  Cl,  T  and  m 
are  kept  fixed).  Moreover,  if  h  £  L2(Q),  the  solution  y  also  satisfies  that 

(2.7)  y  £  L2(8 ,  T-  H2m{Cl))  and  yt  £  L2((8,  T)  x  Cl)  for  all  8  £  (0,  T). 

Proof.  For  all  n  £  N  we  define  yn+1  as  the  solution  of  the  following  iterative  problem 

2/"+1  +  (— A)myn+1  =  h  —  a(t,  x)Akyn  in  Q, 
djyn+1 

■  __  =  o  ,  j  =  0,l,...,m—  1  onS, 
yn+!(0)  =  y0  in  Cl, 


114 


Approximate  Controllability  for  Nonlinear  Parabolic  Equations 


where  y°(t)  :=  0  for  all  t  E  [0,  T\.  The  existence  of  a  solution  yn  E  W  can  be  found, 
for  instance,  in  Theorem  3.4.1  of  Lions-Magenes  [15].  Thus,  for  all  n  E  N\{0, 1}, 
yn+ 1  —  yn  satisfies 


{  ( yn+1  -  yn)t  +  (— A)m(yn+1  -  yn)  =  -a{t,  x)Ak(yn  -  y"”1) 

(2-8)  ,  ;  =  0.1 . m-1 


dpi 

ivn+l  -  ynm  =  o 


in  Q, 

on  S, 
in  fl 


and  therefore 


yn+1  -  yn  E  H1Jtm(Q)  :=  H\ 0,  T;  L2{Q))  n  i2(0,  T;  H2m{Sl)) 


and 

II  yn+1  -  yn  II h^(Q)<  ci  ||  aAk{yn  -  yn~l)  || l2(Q) 

(see,  for  instance,  Theorem  4.6.1  of  Lions-Magenes  [16]).  Then,  since 

H1,2m(Q)  C  C([0,T];i/m(f2)) 

with  continuous  embedding  (see,  for  instance,  Theorems  1.3.1  and  1.9.6  of  Lions- 
Magenes  [15]),  there  exists  c2  =  c2(T)  such  that 

||  yn+1  -  yn  ||c([o,T];ff™(n))<  c2  ||  aAk{yn  -  yn  *)  \\l*(Q)  ■ 

Further,  it  is  clear  that  we  can  choose  (72  =  C2(T)  such  that  for  all  t  E  [0,T] 

II  yn+l  -  yn  ||c([o,t];Hj"(Q))<  C2  ||  aAk{yn  -  yn  x)  ||L2((o,i)xfi)  • 

Hence, 

II  (yn+1  -  yn){t)  ||hj.(h)<  (C2M)2  J*  II  A k{yn  -  yn_1)(r)  ||ia(n)  dr,  for  all  t  E  [0,  T] 

and  therefore,  by  using  the  Poincare  inequality,  there  exists  a  constant  K,  indepen¬ 
dent  of  M,  such  that 

II  (* in+1-yn)(t)  ||3^(n)<  (KM)2  J0  II  (yn~yn~1)(T)  II  w) dr-  for  a11 1 6  M- 

Then,  for  every  t  E  [0,  T]  we  deduce  that 

II  (yn+1  -  yn){t )  ||ffy>(n)<  • lo  "  1 11  (y2  “  2/1)(Tn)  dTn  ■  ■  ' dTl 

<  ( K2M2)n  1 JQ  JQ  ■  ■  -Jo  II  y2  —  y 1  llc([o,T];Ky*(n))  drn  ■  ■  -  dr i 

<  {K2M2)n~l-^—] jjy  ||  y2  -  y1  ||c([o,T];//-(a)) 

-  (n  _  1)!  II  y  ~y  llc([0,T];H-(n))> 

which  implies  that 

||  yn+1  -  yn  ||c«o,Ti;fl5*(n))->  0  asn-MK 
and  therefore,  by  (2.8),  we  deduce  that 

||  (yn+1  ~  Vn)t  ||i,2(o,T;tf-,n(n))— >  0  as  n  — >  oo. 
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Then,  there  exists  y  G  W  such  that 

yn  — >  y  in  W  as  n  — >  oo. 

In  order  to  prove  that  y  satisfies  (2.5)  we  point  out  that 

Amyn->Amy  in  L2(0,T;  H~m(LT))  as  n  -»  cxj, 

Afcy”  — >  Afct/  in  L2(0)  as  n  — >  oo, 

and 

y?->yt  in  L2(0,  T;  H~m(Q,))  as  n  ^  oo. 

this  implies  (passing  to  the  limit)  that  y  is  the  solution  of  (2.5).  In  order  to  prove 
(2.6),  we  “multiply”  in  (2.5)  by  y.  Then  it  is  easy  to  see  that 

(2-9)  1 1 2/  i  |  z,2  (o.r;  jyy*  (n))  +  ||yt||L2(0,T;H-m(n)) 

<  c  (||/i||L2(o,T;//-”*(n))  +  ||yolU2(fi)  +  \\y\W(Q))  ■ 

Furthermore, 

II  y(f)  lli2(n)<  (||  y(0)  ||1,2(0)  +c2  ||  h  111,2(0, T;H-"*(n)))  +  °3  J0  II  ^(s)  lli2(n)  ^ s ■ 

Then,  applying  Gronwall’s  inequality  (see,  for  instance,  Lemma  4  of  Haraux  [11]), 
we  deduce  that 

II  v{t)  ||l,2(o)<  (ll  y(0)  Ili2(n)  +c2  ||  h  ||i2(o,r;//-m(n)))  e°st  V  t  6  [0,  T7]. 

From  here,  we  obtain  that 

II  y  ||l2(q)<  c4  (||h||L2(o,T-,ii'-'>i(n))  +  l|yolU2(fi)) 

which  implies,  together  with  (2.9),  inequality  (2.6).  Now,  thanks  to  (2.6)  and  the 
linearity  of  Problem  (2.5),  we  deduce  the  uniqueness  of  solution. 

Finally,  if  h  G  L2(Q),  since  y(6)  G  H^(Q)  for  all  6  G  (0 ,T),  taking  y{8)  as  initial 
datum  and  applying  Theorem  4.6.1  of  [16],  we  get  (2.7).  ■ 

As  usual  in  Controllability  Theory  we  shall  use  a  unique  continuation  property  for 
solutions  of  the  dual  problem  (in  our  case  Problem  (2.4)). 

Lemma  2.1.  Let  u>  be  a  nonempty  open  subset  of  Ll.  Assume  that 

ipeL2(0,T-H^(Ll))  n  C([0,T];L2(ft)) 

satisfies  (2A)  and  that  tp  =  0  in  O  =  u>  x  (0,  T).  Then  <p  =  0  in  Q. 

Proof.  From  Proposition  2.1  (applied  with  backward  time)  we  deduce  that  ip  G 
L2(0,T  -  8 ;  H2m(tl))  for  all  8  G  (0 ,T).  Then  Lemma  2.1  follows  from  Theorem  3.2 
of  Saut-Scheurer  [17].  ■ 

The  following  two  results  are  easy  adaptations  (by  using  Lemma  2.1)  of  the  similar 
ones  given  in  [9],  [10]  for  second  order  parabolic  problems. 
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Proposition  2.2.  The  functional  J{-\a,yf)  is  continuous,  strictly  convex  on  L2(fl) 
and  verifies 


(2.10) 


lim  inf 

>oo 


J(ip0;a,yd) 
II  F°  IU2(n) 


>  £. 


Further  J(-]a,yd)  attains  its  minimum  at  a  unique  point  <p°  in  L2(Q)  and 


(2.11) 


(p°  =  0  ||  yd  ||i2(n)<  £■ 


Proposition  2.3.  Let  M  he  the  mapping 

M  :  L°°(Q)  x  L2(n)  -»  L2{Ll) 

(a(t,x),yd)  — >  <p°. 

If  B  is  a  bounded  subset  of  L°°(Q)  and  K  is  a  compact  subset  of  L2(fl),  then  M(Bx 
K )  is  a  bounded  subset  of  L2(Lt). 


In  order  to  characterize  the  duality  of  problem  (2.4),  we  recall  that  given  a  convex 
and  proper  function  V  :  X  — ■>  K  U  {+oc}  on  the  Banach  space  X,  it  is  said  that  a 
element  p0  of  V'  belongs  to  the  set  8V(x0)  (subdifferential  of  V  at  xo  G  X)  if 

V (x0)  ~V(x)  <  (p0,x0-x)  V  x  G  X. 

It  is  well  known  that  that  if  V  is  Gateaux  differentiable  its  differential  coincides  with 
its  subdifferential  and  that  xq  minimizes  V  over  X  (or  over  a  convex  subset  of  X) 
if  and  only  if  0  G  dV(xo).  Finally,  if  V  is  a  lower  semicontinuous  function,  then 
Po  G  dV(x0)  if  and  only  if 

(?„,*)<  lim  n*.+M-n*.)(<+00)  Vxex 

h-*0+  h 

(See,  for  instance,  Aubin-Ekeland  [3]).  Coming  back  to  the  functional  J  we  have: 

Lemma  2.2.  For  every  <p°  G  L2(Ll)  ( tp°  0),  if  go  is  the  solution  of  (2A)  satisfying 

<p(T)  =  ip°,  we  have  that 

dJ(ip°;a,yd)  =  {£  G  L2(ty,  3  v  G  sgn(y)xo  satisfying 


J  £(x)8°(x)dx  —  (^J  \ip(t,x)\dT?j  (^j  v{t,x)9(t,x)dFsj 

+£  /  ..  — 9°(x)dx  -  (  yd(x)9°(x)dx  V6>°  G  L2(G)}, 

•fn  II  F  IU2(n) 

where  9  is  the  solution  of  (2 A)  satisfying  9(T)  =  9°. 

Proof.  It  is  an  easy  modification  of  Proposition  2.4  of  [10]. 

Let  us  prove  the  approximate  controllability  property  for  an  special  version  of  the 
linear  problem  given  in  (2.2). 
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Theorem  2.1.  If  ||  yd  ||l2(Q)>  e  and  (p  is  the  solution  of  ( 2.4)  corresponding  to  (p{T)  = 
(p°,  with  </5°  minimum  of  J(-;  a,  yf).  Then  there  exists  v  £  sgn((p)xo  such  that  the 
solution  of 

(  yt  +  (-A )my  +  (-A k){a{t,x)y)  =||  (p  ||Li(0)  v\o  in  Q, 

(2-12)  I  ^  =0  (j  =  0...(m- 1))  on  E, 

(  2/(0)  =  0  in  ft, 


satisfies 


and  then  ||  y(T)  -  yd  ||L2(n)  = 


y(T)  =yd-e- 


IU2(n)  ’ 


e. 


Remark  2.1.  In  the  case  ||  ya  ||l2(q)^  if  we  use  the  null  control,  we  obtain  y  =  0 
and  therefore  ||  y(T)  -  yd  ||l2(q)<  e. 

First  of  all  we  prove  the  existence  and  uniqueness  to  problem  given  by  (2.2). 


Proposition  2.4.  Assumed  y0  £  L2(ft),  h  £  L2(0,T;H  m(ft))  and  a(t,x)  £  L°°(Q), 
there  exists  a  unique  function  y  £  W  satisfying 

(  yt  +  (-A)my  +  Ak(a(t,x)y)  =  h  inQ, 

(2-13)  j  |4  =  0  ,  )  =  0, 1, . . .  ,m  —  1  on  E, 

(  2/(0)  =  ?/o  in  ft. 

Moreover,  we  have  the  estimate 

(2.14)  \\y\\L2(o,T-,H™(n))+\\yt\\L2{o,T-,H-™(n)<C  (||h||L2(o,r;tf-m(n))  +  lbo|U2(n)J  , 
where  the  constant  C  depends  only  on  M  (provided  that  ft,  T  and  m  are  kept  fixed). 


Proof.  For  all  n  £  N  we  define  again  yn+1  as  the  solution  of  the  iterative  problem 

Vt+1  +  (— A)myn+1  =  h  —  A k(a(t,x)yn)  in  Q, 

Qiyn+ 1 

y  J  =  0, 1, . . .  ,m  —  1 


dif  ° 
yn+ 1(0)  =2/0 


on  E, 
in  ft, 


where  y°{t)  :=  0  for  all  t  £  [0,T].  The  existence  of  a  solution  yn  £  W  can  be  found, 
for  instance,  in  Theorem  3.4.1  of  Lions-Magenes  [15].  Thus,  for  all  n  £  N\{0, 1}, 
yn+l  _  yn  jg  solution  of 


(2.15) 


(yn+1-yn)t+(-A)m(yn+1-yn)  =  -Ak[a(t,x)(yn-yn  J)]  inQ, 
di(yn+1—yn)  _ 

^  Qvj - -  =  0  ,  j  =  0. 1, ...  ,m  —  1  on  E, 

( yn+1  —  yn){ 0)  =  0  in  ft 


and  therefore  (see  again  Theorem  3.4.1  of  Lions-Magenes  [15])  yn+1  —  yn  £  W  and 


(2.16) 


,  n+1 


yn  \\w<  ci  ||  a(yn  —  yn  J)  ||L2(q)  . 
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Then,  since  W  C  C([0,  T];  L2(f2))  with  continuous  embedding  (see,  for  instance,  [12] 
or  [15]),  we  have  that 

||  yn+1  ~yn  ||c([o,T];L2(n))—  c2  ||  a(yn  -  y n~1)  ||l2(q)  . 

Further,  as  in  the  proof  of  Proposition  2.1,  we  can  choose  C2  =  C2(T)  such  that 
||  yn+l  ~  Vn  ||c([(M];L2(n))<  C2  ||  a(yn  -  yn _1)  |U2((o,t)xn),  for  all  t  G  [0,  T\. 
Hence, 

||  {yn+1  -  yn)(t)  \\h(n)<  (C2M)2  £  II  ( yn  -  yn_1)(r)  111,2(0)  dT,  for  all  t  G  [0,  T] 


Then,  for  every  t  €  [0,  T]  we  deduce  that 


< 


(c2M2)n-1jo  jT. . .jTn  1 II  y2- y 1  \\c([o,n,vm  dTn--- dTl 


4-Tl — 1 


<  H  y2-?,1  || 

(ClM2T)n~l  „  ,  ^  I|2 


< 


(n-  !)! 


II  V  ~V  llc([o,r];L2(n))> 


which  implies  that 


||  yn+1  -  yn  ||c([o,r];L2(n))— *■  0  as  n-»oo 
and  therefore,  by  (2.16),  we  deduce  that 

||  yn+1  ~  yn  ||iv— >  0  as  n  — >  oo. 
Then,  there  exists  y  eW  such  that 


yn  — >  y  in  W  as  n  — >  oo. 

The  end  of  the  proof  is  similar  to  the  end  of  the  proof  of  Proposition  2.1.  ■ 

Proof  of  Theorem  2.1.  Using  the  subdifferentiability  of  J(.;a,yd)  at  ip0  (^  0  by 
(2.11)),  we  know  that 

0  G  dJ{(p°), 

which  is  equivalent,  from  Lemma  2.2,  to  the  existence  of  v  G  sgn((p)xo ,  such  that 
(2.17)  -||<p|Li(0)  (jov{x,t)e(x,t)dxdt)  =  j~— J^(p\x)e\x)dx 

-  [  yd(x)0°(x)dx. 

Jo 

On  the  other  hand,  as  y  G  W,  if  we  “multiply”  by  9  in  (2.12)  we  obtain,  by  (2.4), 
that 

J^y(T,x)6°(x)dxdt  =||  (p  ||Li(0)  v(x,t)9(x,t)dxdtj 


(2.18) 
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Then,  from  (2.17)  and  (2.18),  we  obtain 


Jn y(T , x)9°(x)dxdt  =  J(yd(x)  -  e-  )9°(x)dxdt  V  6°  G  L2(ft) 


and  we  conclude  that  y(T )  =  yd~  .. - .  ■ 

II  V  ||L2(0) 

Now  we  are  ready  to  prove  a  linear  version  of  Theorem  1.1  for  problem  (2.2) 

Corollary  2.1.  Let  ||  yd  |U2(n)>  £  and  Cp  the  solution  of  ( 2A)  corresponding  to  (p(T )  = 
(p°,  with  minimum  of  j{-\a,yd  —  y(T;a,0)),  where  in  general  y(t\ a,  u)  denotes  the 
solution  of  (2.2)  corresponding  to  the  control  u.  Then  there  exists  v  G  sgn(ip)xo 
such  that  the  solution  of 

(  yt  +  (-A )my  +  (-A k)(a(t,x)y)  =  h  +  h{a)+  ||  $  ||Li(0)  vxo  in  Q, 

|  0  =  °  (j  =  0  "  •  (m  -  i))  on  S, 

i  2/(0)  =  2/o  in 

satisfies 

||  y{T)  -  yd  ||i,2(n)<  £• 

Proof.  We  put  y  =  L  +  Y,  where  L  =  L{a)  satisfies 

'  Lt  +  (-A )mL  +  (-A k)(a(t,  x)L)  =  h  +  h(a)  in  Q, 

(219)  '  S=°  (3  =  °- ••(*"- !))  1  onS> 

.  L(0)  =  y0  in  ft 

and  Y  =  Y(a )  is  taken  associated  to  the  approximate  controllability  problem 

'  Yt  +  (-A)mF  +  (-A k)(a(t,x)Y)  =  u(a)xo  in  Q, 
djY 

<  _  =  0  (j  =  0 . . .  (m  -  1))  on  E, 

y(0)  =  0  in  ft, 

with  desired  state  yd  —  L(T),  i.e.  such  that  ||  Y(T)  —  (yd  —  L(T))  ||<  e.  Notice  that 
the  existence  of  such  a  control  u(a)  is  consequence  of  Theorem  2.1.  In  particular, 
if  ||  yd  -  L(T)  ||  <  £,  we  can  take  u(a)  =  0  and  if  ||  yd  -  L(T)  ||>  e,  then  we  take 
u(a)  =||  (p{a)  ||ii(n)  u(a),  where  v(a)  G  sgn(ip(a))xo  and  (p(a)  is  the  solution  of  (2.4) 
with  initial  value  M{  ( a(x,t),Vd  -  L{T))  )  defined  in  Proposition  2.3.  It  is  obvious 
that  such  function  y  and  such  control  u(a)  lead  to  the  conclusion.  ■ 


3.  Controllability  for  the  nonlinear  problem 

As  mentioned  before,  we  shall  use  a  fixed  point  argument  to  prove  Theorem  1.1. 
In  fact  we  shall  deal  with  multivalued  operators.  Let  us  recall  a  well-known  result: 
the  Kakutani’s  fixed  point  Theorem.  The  usual  continuity  assumption  in  other  fixed 
pont  theorems  is  replaced  here  by  the  following  notion: 
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Definition  3.1.  Let  X,  Y  two  Banach  spaces  and,  A  :  X  — >  V{Y)  a  multivalued 
function.  We  say  that  A  is  upper  hemicontinuous  at  x0  G  X,  if  for  every  p  €  Y' ,  the 
function 

x^o(A(x),p)  =  sup  <p,y>vxY 

yeA(x) 

is  upper  semicontinuous  at  x$.  We  say  that  the  multivalued  function  is  upper  hemi¬ 
continuous  on  a  subset  K  of  X,  if  it  satisfies  this  properties  for  every  point  of  K. 

Theorem  3.1.  (Kakutani’s  fixed  point  Theorem,).  Let  K  c  X  be  a  convex  and 
compact  subset  and  A  :  K  —>  K  an  upper  hemicontinuous  application  with  convex, 
closed  and  nonempty  values.  Then,  there  exists  a  fixed  point  xq,  of  A. 

For  a  proof  see,  for  instance,  Aubin  [2]. 


Proof  of  Theorem  1.1.  We  fix  yd  G  L2(fl)  and  e  >  0.  By  using  Corollary  2.1,  for 
each  2  e  L2(Q)  and  e  >  0  it  is  possible  to  find  two  functions  tp(z)  G  L1(Q)  and 
v(z)  G  sgn(ip(z))xo  such  that  the  solution  y  =  yz  of 

f  yt  +  {-A)my+(-A)k(g{z)y)  =  h  +  h(g(z))+uxo  in  Q, 

(3'1}  g=0,  j  =  0,l,...m-l  onS, 

1  3/(0)  =  2/o  in 

(where  u  =  u(z)  =  \tp(z)\Li[o)v(z))  satisfies 
(3.2)  \y(T)  -  3/d | z,2 (O)  <  £• 

Here  <p(z)  is  the  solution  of  (2.4)  with  initial  value  M{  {g(z),yd  -  L(z\T))  )  (see 
Proposition  2.3)  and  a(t,x )  =  g{z),  where  is  L{z\T)  the  solution  of  (2.19),  with 
a  =  g(z),  at  time  T  . 


Lemma  3.1.  The  set 

{yd  —  L(z\T),  ZGL2(0)}, 
is  relatively  compact  in  L2(fl). 


Proof  of  Lemma  3.1 .  Applying  Proposition  2.4  it  is  easy  to  see  that  the  set  of  solutions 
L(z)  of 


(3.3) 


Lt  +  (-A )mL  +  (-A )k{g(z)y)  =  h  +  h(g{z))  in  Q, 
c»L 

•  —  =  0,  j  =  0,l,...m- 1  onE, 

oW 

L( 0)  =  T/o  m  it, 


satisfy 

(3.4)  |[  L(z)  \\w<  K(  1+  ||  7/o  ||l2(o)  +  ||  h  ||r,2(o,T;fl--’»(Q)))  V  z  G  L?(Q) 

with  K  >  0  independent  of  z.  Recall  that  ||  g{z)  ||l°°((3)<  Af  with  M  independent  of 
2.  Now,  let  L(zn)  be  a  sequence  of  solutions  (3.3)  with  zn  G  L2(Q).  We  must  prove 
that  there  exists  a  subsequence  (that  we  rewrite  as  L(zn)),  such  that 


L{zn-,T )  -  L(zn+uT)  ||x,2(I2)->  0  as  n  -»  oo. 
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By  a  compactness  result  due  to  Aubin  [1],  we  know  that 

W  C  L2(0,T;  Hm~1(Q))  with  compact  embedding. 

Therefore,  by  (3.4),  we  can  suppose  that 

II  L(zn)  —  L(zn+ 1)  ||r,2(o,T;rfm-l(n))— *■  0  as  n  — >  oo. 

Further,  it  is  easy  to  prove  that  L(zn)  —  L(zn+ 1)  satisfies 

II  L(zn]  T)  —  L(zn+i,T)  ||z,2(fi) 

<  -  jf  (Dk  ( g(zn)L(zn )  -  g(zn+1)L(zn+1)) ,  Dk  ( L(zn )  -  L(zn+1))) 

(T 

+  J I  ( Dk  ( g(zn)s0  -  g(zn+i)s0) ,  Dk  ( L(zn )  -  L(zn+ 1))) 

Then,  by  (3.4),  since  k  <  m  —  1  (notice  that  k  =  0  if  m  =  1), 

II  L(zn\  T)  —  L(zn+i ;  T)  |||,2(n)<  K  ||  L(zn)  —  L(z„+i)  |||2(0iT;H™-i(n))-^  0  as  n  — >  oo 

and  the  proof  ends.  ■ 

Completion  of  Proof  of  Theorem  1.1.  From  Lemma  3.1,  we  obtain  that  yd  -  L(z ;  T) 
belongs  to  a  compact  set  for  all  z  £  L2(Q)  and  soy  by  using  Propositions  2.3  and  2.1, 
we  obtain  that 

(3.5)  (||  <p(z)  || £i(o)  v(z),  z  e  L2(Q)}  is  bounded  in  L°°{Q) 

Thus 

(3.6)  Kx  =  sup  ||  ip(z)  || £i(o) <  oo- 

zei2(Q) 

Obviously,  u  =  u(z)  satisfies 

(3.7)  ||  u  ||l2(Q)<  K2- 
Therefore,  if  we  define  the  operator 

A  :  L\Q)  -»  V(L2(Q)) 
by 

A(^)  =  {y  satisfies  (3.1),  (3.2)  for  some  u  satisfying  (3.7)  }, 

we  have  seen  that  for  each  z  £  L2(Q),  A(z)  ±  0.  In  order  to  apply  Kakutani’s  fixed 
point  theorem,  we  have  to  check  that  the  next  properties  hold: 

(i)  There  exists  a  compact  subset  U  of  L2(Q),  such  that  for  every  z  £  L2(Q), 
A (z)  C  U. 

(ii)  For  every  z  £  L?(Q),  A(z)  is  a  convex,  compact  and  nonempty  subset  of 

i HQ)-  ' 

(iii)  A  is  upper  hemicontinuous. 
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The  proof  of  these  properties  is  as  follows: 

(i)  From  Proposition  2.4  we  know  that,  there  exists  a  bounded  subset  U  of  W  such 
that  for  every  z  G  L2(Q),  A(z)  C  U.  Now,  to  see  that  we  can  choose  U  compact  we 
shall  prove  that  the  set 

y  =  {y  satisfying  (3.1)  for  some  z  G  L2(Q)  and  u  satisfying  (3.7)} 
is  a  relatively  compact  subset  of  L2(Q).  But  this  is  easy  to  prove  by  using  that 
(3.8)  W  C  L2(Q )  with  compact  embedding 

(see  Lions  [12]  or  Simon  [18]). 

(ii)  We  have  already  seen  that  for  every  z  G  L2(Q ),  A(z)  is  a  nonempty  subset  of 
L2(Q).  Further  K(z)  is  obviously  convex,  because  B(yd,  e )  and  {u  G  L2(Q)  :  satisfying 
(3.7)}  are  convex  sets.  Then,  we  have  to  see  that  A(z)  is  a  compact  subset  of  L2(Q). 
In  (i)  we  have  proved  that  A (z)  C  U  with  U  compact.  Let  (yn)n  be  a  sequence  of 
elements  of  A(,z)  which  converges  in  L2(Q)  to  y  G  U.  We  have  to  prove  that  y  G  A(z). 
We  know  that  there  exist  un  G  L2{Q)  satisfying  (3.7)  such  that 


(3.9) 


'  y”  +  (—A )myn  +  (-A )k(g(z)yn)  =h  +  h{g{z))  +  unxo  in  Q, 

— v—  =  0  ,  j  =  0,l,...,m—  1  onS, 

ovl  .  _ 

yn(0)  =  y0  m 

.  I Vn(T)  -  2/d 1 2  <  £• 


Now,  by  using  that  the  controls  un  are  uniformly  bounded,  we  deduce  that  un  — >  u 
in  the  weak  topology  of  L2(Q )  and  u  satisfies  (3.7)  (see  Proposition  III. 5  of  Brezis 
[5]).  Then,  using  (3.9)  and  Proposition  2.4  we  can  see  that  (yn)n  converges  to  y  in 
the  weak  topology  of  W  (and  so,  by  (3.8),  strongly  in  L2{Q)).  Therefore,  passing  to 
the  limit  in  (3.9)  we  obtain 


'  yt  +  (—A )my  +  (-A )k{g{z)y)  =  h  +  h(g(z))  +  uxo  in  Q, 

•  ^  =  0  ,  j  =  0,l,...,m-  1  on  E, 

2/(0)  =  2/o  m 


Further,  vn  =  y  —  yn  is  solution  of 

'  Uj"  +  (-A)mvn  +  (-A )k{g(z)vn)  =  {u-  un)xo  in  Q, 

(3.10)  ^V=0,  j  =  0,l,...,m-l  onE, 

ovJ  .  ^ 

vn(0)  =  0  m  il 

and  satisfies  vn  G  W  (see  Proposition  2.4).  Further,  if  we  “multiply”  in  (3.10)  by  vn 
and  integrate,  we  obtain  that 


vn(T)  |||2(o)<  k 


un)xovndxdt 


Thus  yn{T)  converges  to  y(T)  in  the  strong  topology  of  L2(Q)  and  ||  y(T)—ya  ||2<  £■ 
This  prove  that  y  G  A (z)  and  concludes  the  proof  of  (ii). 
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(iii)  We  must  prove  that  for  every  zq  G  L2(Q) 

limsup  a(A{zn),  k)  <  a(A(zo),k),  V  k  G  L2(Q). 

L2(Q) 

Zfi  *  ZQ 

We  have  seen  in  (ii)  that  A{z)  is  a  compact  set,  which  implies  that  for  every  n  G  N 
there  exists  yn  G  A (zn)  such  that 

a(A(zn),k )  =  [  k(x,t)yn(x,t)dxdt. 

Jq 

Now,  by  (i),  ( yn)n  C  U  (compact  set  of  L2{Q)).  Then,  there  exists  y  G  A2(Q)  such 
that  (after  extracting  a  subsequence)  yn  — >  y  in  L2(Q).  We  shall  prove  that  y  G  Afyo). 
We  know  that  there  exist  un  G  L2(Q)  satisfying  (3.7)  such  that 


(3.11) 


Vt  +  (-A  )myn  +  (-A  )k(g{zn)yn)  =  h  +  h(zn)  +  unxo 
djyn 

=  J=0,l,...,m  — 1 

yn{  o)  =  yo 

\yn(T)  -  2/dla  <  e. 


in  Q, 

on  S, 
in  Q, 


Then  there  exists  u  G  T2(Q)  satisfying  (3.7)  such  that  un  — ►  u  in  the  weak  topology 
of  T2(0).  On  the  other  hand,  by  using  the  smoothing  effect  of  the  parabolic  linear 
equation  (in  a  similar  way  to  the  proof  of  (ii))  and  that  g  G  L°°(R)  flC(R),  we  deduce 
that  y  satisfies  (3.1)  and  (3.2)  with  z  =  z0  for  some  u  G  L2(Q)  satisfying  (3.7),  which 
implies  that  y  G  Afyo).  Then,  for  every  k  G  L2(Q), 

a(A(zn),k)=  /  k(x,  t)yn(x,t)dxdt  — >  /  k(x,t)y(x,t)dxdt 
Jq  Jq 

<  sup  /  k(x:t)y(x,t)dxdt  =  a(A(zo),k), 

yeA(zo)  JQ 

which  proves  that  A  is  upper  hemicontinuous  and  conclude  the  proof  of  (iii). 

Finally,  if  we  restrict  A  to  K  =  conv(U )  (the  convex  envelope  of  U ),  which  is  a 
compact  set  of  L2(Q),  it  satisfies  the  assumptions  of  Kakutani’s  fixed  point  theorem. 
Then,  A  has  a  fixed  point  y  G  K.  Further,  by  construction,  there  exists  a  control 
u  G  L2{Q)  satisfying  (3.7)  such  that 


(3.12) 


Vt  +  (-A )my  +  (-A )k(f(y))  =  h  +  u\o  m  Q, 

djy 

—  =0,y  =  0,  l,...m— 1  onS, 

2/(0)  =  2/o  in  Q, 

.  \y(T)  -  yd\2  <  e. 


Therefore,  y  is  the  solution  that  we  were  looking  for. 


Remark  3.1.  Several  generalizations  seem  possible.  For  instance,  the  equation  of 
(1.1)  could  be  replaced  by  other  ones  with  a  more  general  nonlinearity 

Vt  +  (-A)m2/  +  5Z(-A)7i(y)  =  h  +  vXu, 

i= 0 
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or  a  more  general  lower  order  differential  operator 

yt  +  (-A)my  +  L(f(y))  =  h  +  vXu, , 

with  L  suitable  linear  partial  differential  operator  of  degree  lower  than  2m.  The  key 
point  in  those  generalizations  is  that  the  unique  continuation  result  of  Lemma  2.1,  for 
the  associated  dual  problem,  remains  true  thanks  to  Theorem  3.2  of  Saut-Scheurer 
[17]  and  the  rest  of  arguments  of  the  proof  of  Theorem  1.1  apply. 

4.  Non-controllability  for  superlinear  problems 

In  this  section  we  assume  k  =  0.  We  shall  prove  a  result  of  non-controllability  for 
a  superlinear  nonlinear  term  with  W  C  ft. 

Theorem  4.1.  Let  p  >  1  and  let  y{t\u )  =  y  £  L2(0,  T;  PI  C([0,T];L2(fl))  a 

function  satisfying 

(  yt  +  (-A )my  +  \y\p~ly  =  «x u>  in  Q . 

\  y(0)  =  2/0  in  fi, 

associated  to  any  “natural”  boundary  condition  and  with  control  u  £  L2(Q).  Then 
we  can  choose  yd  £  L2(Q)  and  e  >  0  such  that 

(4.1)  ||  y{T;  u )  -  2 Jd  ||i2(fi)>  £  f°r  anV  u  £  L2{Q). 

In  order  to  prove  Theorem  4.1  we  introduce,  previously,  some  auxiliar  functions. 
Given  R  >  0  we  define,  on  Mw,  the  functions 

&(*)  =  (i?2  -  \x\2)/R  if  \x\  <  R,  fR(x)  =0  if  |*|  >  R 

and 

(4.2)  dR(x)  =  R  -  |m|  if  |m|  <  R,  dR(x)  =  0  if  |x|  >  R. 

It  is  clear  that 

(4.3)  dR(x)  <  £r(x)  <  2dR{x) 
for  all  x  £  M.N . 

The  following  result  was  proved  in  Bernis  [4], 

Proposition  4.1.  Let  s  >2m  and  R>  0.  Then,  for  each  e  >  0  there  exist  a  constant 
C  depending  only  on  N,  m,  s  and  e  (thus  independent  of  R)  such  that  the  following 
inequality  holds  for  all  y  £  Hffc(RN): 

((-A)ra2 /, fty)WxW  >  (!  -  £)  £.R\Dmy\2dx  -  C  JRN  CR~2my2dx. 

Remark  4.1.  Since  s  >  2m,  fsR  £  W2m-°°(Mw).  Hence  £  C™(RN)  (see  e.g.  Corol¬ 
lary  IX.  13  of  [5];  and  fsRu  €  tfcm(Mw)  (see  e.g.  Note  IX. 4  of  [5]). 
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Corollary  4.1.  Let  s>2m  and  R>  0  such  that  TPr  C  ft.  Then,  for  each  £  >  0  there 
exist  a  constant  C  depending  only  on  N,  m ,  s  and  e  (thus  independent  of  R)  such 
that  the  following  inequality  holds  for  all  y  £ 

((-A)my,£,sRy)H-™{n)xH™(n)  >  (1  —  e)  [  £SR\Dmy\2dx  —  C  f(sR  2my2dx. 

Proof.  Let  y  £  Hm(Q )  such  that  y  =  y  in  0  (such  y  exists  by  standar  results:  see, 
e.g.,  Chapter  IX  of  Brezis  [5]).  Then,  by  Proposition  4.1,  the  inequality  holds  for  y, 
but  as  Br  C  we  obtain  the  result.  ■ 

Theorem  4.2.  Let  p  >  1,  r  =p  +  1,  y0  £  L2(fl)  and  u  £  Lr'(Q).  Then  any  solution 
y  £  Lr(Q)  D  L2(0,  T ;  Hm(£i))  of 


(4.4) 


{ 


Vt  +  (-A )my  +  \y\p  1y  =  u  in  V(Q), 

2/(0)  =  yo  on  0, 

with  any  “natural”  boundary  condition,  satisfies  the  local  estimate 


sup 
o  <t<rJBR 


[  y(x,t)2dx+  [  (\Dmy\2 +  \y\r)dxdt 

J  Br  JBRx(0,T) 


<K\  1+  [  \u\r'dxdt+  ( 

V  JbRix(p,T)'  JbRi  J 

if  BRl  C  0  and  0  <  R  <  R\ .  Moreover,  the  constant  K  depends  only  on  N,  m,  p, 
R,  Ri  and  T. 

Proof  of  Theorem  4.2.  We  take  Xr  =  Lr{Q)  n  L2(0,T;  H^(Q).  Then  the  equation 
of  (4.4)  is  satisfied  in  X'r  =  Lr'(Q)  +  L2(0, T;  Then,  if  s  >  2m,  we  can 

multiply  (4.4)  by  £Ry  with  the  duality  product  (•,  -)x(.xxr  and  we  obtain 

A  [  tRy{x>T)2dx  +  {(-A)m?/,^J/)L2(0,T;R-m(n))xL2(0,T;Rf*(O)) 

Z  J  Br 


+  (\y\p  1y>  thky)  l*-'  (Q)xL'-(Q) 


=  2  JBr  ^Ryo(x)2dx  +  (u>  ^y)W(Q)xir(Q)- 

Now,  from  Corollary  4.1  it  follows  that 

U  &y(x,T)2dx+[  eR{\Dmy\2  +  \yY)dxdt 
2  Jbr  Jbrx(o,t ) 


(4.5) 


<  C  [  £sRy0(x)2dx  +  C  [  £r  2my2dxdt  +  C  [  (fRuydxdt. 

J Br  J BrX{0,T)  JBrX(0,T) 


Jbrx(0,T ) 


Brx(0,T) 


By  (4.2)  and  (4.3)  we  can  replace  in  (4.5)  £r{x)  by  R-\x\  (modifying  the  constants). 
Further,  writing  s-2m  =  2s/r  +  (s(r-  2) /r)  -  2m,  we  can  apply  Holder’s  or  Young’s 
inequality  with  exponents  q  =  r/2  and  q1  =  r/r  —  2  and  we  obtain 


l 


( R  —  |rc|)'s  2my2dxdt 


Brx(0,T) 


<  £ 


[  (R  -  \x\)s\y\r dxdt  +  K(e,q)  [  (R  -  |x|) 

Jbrx(0,T)  J  Brx(0,T) 


—  IdP  7 dxdt 
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_ 

.  n  1  ,  2mr 

Kle.q)  =  — — r~rr  and  7  = - - 

v  ’  q'{qe)q  /q  r-  2 

Hence,  if  we  choose  s  >  7  -  1,  the  last  integral  is  finite  and  equal  to  CRS+N~7.  On 
the  other  hand,  we  can  apply  again  Young’s  inequality  and  we  have 


/  {R  —  \x\)suydxdt 

JBrx{0,T) 

<e  f  (R-\x\Y\y\rdxdt  +  k(e,r)  [  (R  -  \x\)s\u\r' dxdt. 

JBRx(0,T)  Jbrx(0,T) 

Thus,  by  changing  the  constants,  we  deduce  that 

(R-  M)s2/(z,  Tfdx  +  [  (R-  \x\n\Dmy\2  +  \y\r)dxdt 

2  Jbr  Jbrx(o,t) 

-  °  (/*,<*  -  +  +  /a„, o.r,(*  -  WW***)  ■ 

Finally,  by  replacing  R  by  Ri  and  by  taking  into  account  that  R\  -  \x\  >R1  —  R  and 
i?!  —  |jc|  <  Ri  if  |x|  <  R  we  deduce  the  result  with 

r,  \r(  Ri  Y  CR{+n^  \ 

K-™*\C{rPXr)’  (jRi  -«)•/•  ■ 


Proof  of  Theorem  4.1.  It  is  a  trivial  consequence  of  Theorem  4.2  since,  if  Ri  satisfies 
BRl  C  Q\u,  then 

II  y(u:>T)  |||2(n)<  K(l+  ||  yo  ||i2(n))  Vu  G  Lr  (Q). 

Therefore,  taking  yd  with  ||  yd  ||L2(n)  large  enough,  we  obtain  (4.1)  for  e  >  0  small 
enough.  ■ 

Acknowledgements.  We  thank  F.  Bernis  for  some  useful  conversations. 
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Control  Problems  for  Parabolic  Equations  with 
State  Constraints  and  Unbounded  Control  Sets 
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Department  of  Mathematics 
University  of  California 

Abstract.  Using  nonlinear  programming  theory  we  derive  a  version  of  Pontryagin’s  max¬ 
imum  principle  for  abstract  parabolic  equations  that  includes  state  constraints  and  allows 
unbounded  control  sets.  The  results  are  shown  to  apply  to  parabolic  distributed  parameter 
systems  and  to  the  Navier-Stokes  equations. 

1991  Mathematics  Subject  Classification.  93E20,  93E25 

Key  words  and  phrases.  Distributed  parameter  systems,  optimal  controls,  unbounded  con¬ 
trols,  state  constraints. 


1.  Introduction 

Consider  the  reaction— diffusion  distributed  parameter  system 

(1.1)  =  ^y(t,x)  +  ^(y(t,x),Vy(t,x))  +  u(t,x)  (t>  O.zefi) 

in  a  domain  0  C  Mm  with  boundary  F,  a  boundary  condition  ft  acting  on  F.  Control 
problems  for  (1.1)  may  include  constraints  on  the  state  y(t ,  x)  and  the  gradient,  either 
pointwise 

(1.2)  y(t,  x)eMs  C  R,  Vy(t,  *)  €  Mff  C  Rm  (0  <  t  <  t) 
or  of  integral  type.  Target  conditions  can  be  also  pointwise, 

(1.3)  y(t,  x)  e  Ys  C  R,  Vy(t,  x)  eYg  C  R™ 

or  of  integral  type,  the  control  interval  [0,  t  ]  fixed  or  variable.  Control  constraints 
may  include 

(1.4)  u(t,x)eU  Cl  (0<i<i) 

and  (when  U  is  unbounded)  summability  conditions  in  the  cylinder  (0,  t)  x  fl. 

Interest  in  optimal  control  problems  with  unbounded  controls  is  more  than  aca¬ 
demic.  When  the  control  set  is  unbounded,  Pontryagin’s  maximum  principle  not  only 
gives  optimal  controls  as  solutions  of  independent  maximization  problems  for  each 
time  t  but  includes  the  statement  that  the  maximum  is  finite  -  sometimes  a  very 
potent  pronouncement.  (For  ingenious  ways  of  putting  this  to  use  in  finite  dimen¬ 
sional  systems  see  [8]).  As  an  infinite  dimensional  possibility  of  obvious  interest  we 
mention  that  of  setting  up  solutions  of  the  Navier-Stokes  equations  as  solutions  of 
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minimization  problems  involving  purely  differential  equations  (i.e.  relegating  nonlo¬ 
cal  operators  to  the  cost  functional).  This  can  be  done  in  several  ways,  and  in  all 
these  problems  the  controls  are  naturally  unbounded. 

Except  for  particular  cases  (such  as  the  linear-quadratic  problem)  interest  in  Pon- 
tryagin’s  principle  for  unbounded  control  sets  seems  of  recent  date.  Raymond  and 
Zidani  treat  in  [9]  and  [10]  boundary  control  problems  for  parabolic  equations.  Un¬ 
bounded  control  sets  were  considered  in  [6]  for  distributed  parameter  systems  with 
smooth  nonlinearities  in  reflexive  spaces.  These  conditions  fit  systems  described  by 
nonlinear  wave  equations  but  would  force  unreasonable  assumptions  on  a  system 
like  (1.1)  (for  instance,  they  would  not  cover  such  nonlinearities  as  cj>(y)  —  —  y3  or 
(f> (y,  Vy)  =  ( y  ■  V)y.  We  indicate  in  this  paper  how  to  handle  abstract  parabolic  sys¬ 
tems,  with  applications  to  reaction-diffusion  equations  and  the  Navier-Stokes  equa¬ 
tions. 


2.  Abstract  differential  equations 

We  study  (1.1)  via  the  abstract  model 

(2.1)  y'(t)  =  Ay(t )  +  f(t,  y(t))  +  Bu(t),  y{ 0)  =  C, 

The  operator  A  in  (1.1)  generates  a  bounded  analytic  semigroup  S(t)  in  a  reflexive 
separable  Banach  space  E  (for  a  nonreflexive  setup  see  §5)  and  0  £  p{A).  The  control 
space  for  (2.1)  is  F  =  X*  (X  a  separable  Banach  space)  and  B  :  X*  — ►  E  is  a 
bounded  operator  with  B*  :  E*  — >  X. 

The  assumptions  on  A  allow  construction  of  the  fractional  powers  (— A)a;  (—A)a 
is  bounded  for  a  <  0.  For  any  a,  (~A)aS(t)  is  a  bounded  operator,  continuous  in 
(E,  E )  for  t  >  0  ((E,  F)  =  {linear  bounded  operators  from  a  Banach  space  E  into 
the  Banach  space  F  equipped  with  the  operator  norm})  and 

(2.2)  ||(-A)a5(t)||  <Catae~a  {t>  0,  0<o<l). 

If  a  >  0  we  set  Ea  =  D((-A)a)  equipped  with  the  norm  \\y\\Ea  =  ||(-A)“j/||. 
Invertibility  of  (~A)a  implies  that  Ea  is  a  Banach  space.  For  a  <  0,  Ea  is  the 
completion  of  E  with  respect  to  the  norm  ||  •  ||sQ.  Since  E  is  reflexive,  A*  is  the 
infinitesimal  generator  of  the  strongly  continuous  semigroup  S*(t)  =  S(t)*  and  we 
define  the  spaces  (E*)a  using  the  fractional  powers  (— A*)a  =  ((— A)a)*  in  the  same 
way  the  Ea  are  defined  from  the  fractional  powers  {—A)a.  We  have 

(2.3)  (~A)^Ea  =  Ea+1,  {-A*)~\E*)a  =  (J E*)a+1 

for  7  >  0  and  — oo  <  a  <  oo  [5]. 

We  say  that  the  function  /  :  [0,  T]  x  Ea  — >  E~p  (a,  p  >  0,  a  +  p  <  1)  satisfies 
Hypothesis  Da>p  if  the  Frechet  derivative  dyf(t,y)  £  (Ea,  E_p)  exists  and 

(i)  f(t,  y)  is  continuous  in  y  £  Ea  for  t  fixed  and  strongly  measurable  in  t  for  y 
fixed, 

(ii)  dyf(t,  y)C  is  continuous  in  y  £  Ea  for  t  and  (,  £  Ea  fixed  and  strongly  measur¬ 
able  in  t  for  y,  (  fixed, 
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(Hi)  For  every  c  >  0  there  exist  constants  K(c),  L(c)  such  that 

(2.4) 

\\f(t,y)\\E.p  <  K(c ),  \\dyf(t,y)\\(Ea,E.p)  <  L(c)  (0  <  t  <  T,  \\y\\Ea  <  c). 

Controls  u(-)  in  (2.1)  are  elements  of  Lp(0,  T\X*)  with  p  >  1/(1  —  a),  where 
LPf(0, T;X*)  is  the  space  of  all  X-weakly  measurable  X*-valued  functions  u(-), 
equipped  with  the  usual  IP  norm;  this  space  is  the  dual  of  Lq( 0,  T;  X)  (1/p+l/q  =  1) 
and  does  not  in  general  coincide  with  LP(0,T;X*)  (except  when  X*  is  separable). 
The  control  set  U  is  an  arbitrary  subset  of  X*,  and  the  admissible  control  space 
Cad(0,  T;  U)  consists  of  all  u(-)  G  L£,(0,T;X*)  such  that  u(t)  G  U  a.e. 

Solutions  of  (2.1)  are  y(t)  =  (-A)~ar](t),  where  rj (t)  solves  the  integral  equation 

V(t )  =  (-A)aS(t) C  +  f(-A)a+»S(t  -  T)(-A)~»f(T,  (-A)-arj(T))dT 

(2.5)  t  Jo 

+  [  (—A)aS(t  —  T)Bu(r)dT. 

Jo 

The  assumptions  guarantee  local  existence  (see  [5]  for  details).  Note  that  if  z  G  E* 
we  have  (z,  Bg(t))  =  ( B*z,g(t )),  so  that  Bu(-)  is  if* -weakly  measurable;  since  E  is 
separable,  Bu(-)  is  strongly  measurable. 

We  consider  optimal  control  problems  for  (2.1)  in  a  fixed  or  variable  interval  0  < 
t  <t.  State  and  target  conditions  are  given  by 

(2.6)  y(t)  G  M  (0  <  t  <  t ),  y(t,  u)  G  Y, 

with  the  state  constraint  set  M  C  Ea  and  the  target  set  Y  C  Ea  closed  in  Ea.  The 
cost  functional  is 

(2.7)  Vo(t,u)  =  f0(T,y(T),u(T))dT 

where  f0  :  [0,T]  x  Ea  x  U  —>  R  satisfies  Hypothesis  this  means  the  Frechet 
derivative  dyfo(t,y,u)  G  (Ea)*  exists  and 

(i0)  For  every  t,  u  fixed  fo(t,y,u)  and  dyf0(t,y,u )  are  continuous  in  y  G  Ea, 

(iio)  For  every  u(-)  G  Cad(0,T-,U)  and  y  G  Ea  fixed  t  — >  fo(t,y,u(t))  and  t  — > 
dvfo(t,y,u(t))  are  strongly  measurable  in  their  home  spaces, 

(m0)  For  every  u(-)  G  Cad(0,T ;  U)  there  exist  constants  K0(c),  L0(c)  such  that 

\fo{t,y,u(t))\  <  K0(c)\\u(t)\\p,  \\dyfo(t,y,u(t))\\E_a<L0(c)\\u(t)\\p 

(2'8)  (0<t<T,|Mk<c). 

3.  The  minimum  principle 

For  definiteness,  we  limit  ourselves  to  the  model  (2.1)  under  Hypothesis  The 
admissible  control  space  Cad(0,T\  U )  is  equipped  with  the  distance 

(3.1)  <*(«(•).«(•))  =  10  G  [0,T];«(t)  +  w(*)}|, 

(j  •  |  =  Lebesgue  measure).  In  general,  Cad(0,T\U)  is  not  complete  under  d  if  the 
control  set  U  is  unbounded;  for  instance,  consider  U  a  cone  ^  {0}  in  X*,  u  G  U, 
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un(t )  =  where  x„(-)  is  the  characteristic  function  of  [1  /n,T\.  This  is  the 

key  difficulty  precluding  direct  application  of  the  methods  in  [5] . 

The  dual  space  ( Ea )*  is  given  by  ( Ea )*  =  (E*)-a  with  pairing 

(3.2)  {z,y)(E*)-axEa  =  {{-A*Yaz,(-A)ay)E*xE- 

The  proof  follows  from  (2.3);  see  [5,  Lemma  4.4]  for  details  of  a  more  general  result 
(actually  needed  in  §5). 

Given  any  Banach  space  E,  the  space  £(0,  T;  E *)  consists  of  all  countable  additive 
bounded  E*  -valued  measures  fi(ds)  defined  in  the  field  generated  by  the  closed 
sets  of  [0,T];  this  space  is  the  dual  of  (7(0,  T;  E),  the  duality  given  by  {fi,y)c  = 
Jq  (p(ds),y(s)).  Since  ( Ea )*  =  (£*)-«  we  have  £(0,T;  (£*)_„)  =  £(0 ,T-(Ea)*)  = 
<7(0,7";  Ea)*.  See  [1]  for  further  details  on  vector— valued  measures. 

Let  F  be  a  Banach  space.  We  call  a  sequence  {Qn},  Qn  €  F  precompact  if  every 
sequence  {qn},'Qn  G  Qn  has  a  convergent  subsequence.  A  closed  set  Z  C  F  is  T—full 
at  x  £  Z  if,  for  every  sequence  {xn}  C  Z  such  that  xn  — >  x  there  exists  p  >  0  and  a 
precompact  sequence  { Qn },  Qn  C  F  such  that  the  sets  Tz(xn)  G  B( 0,  p)  +  Qn  contain 
a  common  ball  B(0,  e)  for  n0  large  enough  (Tz(x)  is  the  Clarke  tangent  cone  to  Z 
at  x).  If  the  condition  above  is  satisfied  with  Qn  =  (0),  Z  is  strongly  T—full  at  x. 
Finally,  if  is  T-full  (resp.  strongly  T-full)  if  it  is  T-full  (resp.  strongly  T-  full)  at 
every  x  e  Z.  Examples  of  strongly  T-full  sets  are  closed  convex  sets  with  nonempty 
interior;  for  other  examples  see  [6] . 

Given  a  sequence  { Zn }  of  subsets  of  F,  lim  infn^oo  Zn  is  the  set  of  all  z  =  limbec  zn 
with  zn  €  Zn.  If  Z  C  F,  Z~  C  F*  is  the  polar  cone  of  all  z*  e  F*  with  {z*,z)  <  0 
(z  e  Z).  Finally,  we  define 

M (t)  =  {2/(-)  €  C(0,t;Ea);  y(t)  6  M  (0  <  t  <  i)}. 

and  assume  that  M (t)  is  strongly  T-full  in  (7(0,  t;  Ea )  and  that  Y  is  strongly  T-full 
in  Ea. 

Theorem  3.1  below  for  (2.1)  gives  necessary  conditions  for  optimality  of  a  control 
u(-).  We  assume  that  the  set  of  Lebesgue  points  of  all  functions  ( v  e  U ) 

has  full  measure  in  [0,  T]. 

Theorem  3.1.  Let  u(-)  €  Cad(0,t',U)  be  an  optimal  control  in  0_<  t  <  t.  Then 
there  exists  a  double  sequence  {(?/„(')> i/m)}  £  M(f)  xYC  C(0,t',Ea)  x  Ea  such 
that  ($£(■),  Vm)  (y(su),y{t,u))  as  n  -►  oo  for  m  =  1, 2, . . .  and  a  multiplier 
{z0,p,,z)  Glx  £(0,t;  ( Ea )*)  x  ( Ea )*,  ( z0,p,,z )  ^  0,  satisfying 

(3.3)  zo>0,  pe  (lminf  liminfTM(,)(?/"(-)))  ,  26  (liminf  lim  inf  TY{ynS)  , 
and  such  that 

(3.4)  z0{fo{s,y(s,u),v)-fo{s,y(s,u),u(s))}  +  (B*z{s),v-u(s))  >  0  (v  €  U) 
a.e.  in  0  <t  <t,  where  z{s )  is  the  solution  of 

dz(s)  =  ~  {A*  +dyf(s,y(s,u))*}z{s)ds 

-  z0dyfo(s ,  y(s,  u),  u(s))ds  -  p(ds),  z(t)  =  z. 


(3.5) 
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By  definition,  z(s)  =  (~A*)  pv(s),  where 
v(s)  =  Rafi(i,s)*((-A)-ayz 

(3.6) 


+ 


j  Ra,o(<J,s)*((-A)  a)*fi(da)  =vh(s) +  Vi{s), 


and  the  operator  Ra,o(t,  s)  is  defined  by  the  integral  equation 
Rafi(t,s)(=(-A)aS(t-s)t 

(3'7^  +  £  (-A)aS{t-T)dyf(T,y(T,u))(-A)-aRafi(T,s)(dT. 

(the  operator  (- A)~aRafi(t ,  s)  is  the  propagator  or  solution  operator  of  the  variational 
equation  £'(t)  =  {A  +  dyf(t,y(t,u))}£(t)).  Finally,  Vi(t)  in  (3.6)  is  understood  as 
follows:  for  each  s,  Vi(s)  is  the  only  element  of  E*  satisfying 

(3.8)  (y,  Vi(s))  =  ^  (Ra,o(v,  s)y,  ((-A)-a)*p(da))  (y  €  E). 

Using  the  integral  equation  (3.7)  we  obtain 

(3.9)  \Ra,o{t^)\\{E,E)<C(t-s)-a  (0  <s<t<t) 

so  that 

(3.10)  \\vi{s)\\E  <  Ctoa{s)  =  C  j\t  -  a)-aM(da)  (0  <s<t<t). 

Integrating  in  0  <  s  <  t  and  using  Tonelli’s  theorem  we  deduce  that  wa(-)  G  L ^O,  t ). 
It  follows  from  its  definition  that  u(-)  is  E-weakly  measurable,  thus  is  strongly 
measurable  by  separability. 

Here  is  the  strategy  for  the  proof.  Let 

(3.11)  Cad(0,T;U,u)m  =  {«(•)  e  Cad(0,T-U)-,  \\u(a)  -  u(a)\\  <  m} 

for  m  =  1,2, _ This  subspace  is  complete  in  the  distance  (3.1);  moreover  it  is 

patch  complete  in  the  sense  of  [4],  [5]:  if  u(-),v(-)  G  Cad(0,  T;  U,  u)m  and  e  C  [0,  T] 
is  measurable,  the  control  equal  to  u(t)  in  e  and  to  v(t)  outside  of  e  belongs  to 
Cad(Q,T;U,u)m.  Finally,  it  follows  from  (2.4)  and  (2.8)  that 

(3.12)  \\f(t,y)\\  +  ||B«(t)||  <  K(c)  +  C(\\u(t)\\  +  m),  \\dyf(t,y)\\  <  L(c) 

(3.13)  \f0(t,  y,  u(t))\ <  K0(c)\\u(t)  +  m||p,  \dyf0(t,y,u{t))\<L0(c)\\u(t)  +m\\p 

all  bounds  valid  for  0  <t  <T,  \\y\\sa  <  c  independently  of  u(-)  G  Cad{0,T-,  U,u)m. 
Taking  into  account  that  if  u{- )  is  optimal  in  Cad{ 0,  t;  U)  it  is  optimal  in  any  sub¬ 
space  that  hosts  it,  in  particular  in  Cad(0,i-,U,u)m,  we  can  apply  [5,  Theorem  6.2] 
and  deduce  that  for  each  m  there  exists  a  sequence  {(?/„(')> 2/m)}  5=  M(f)  x  Y 
with  (y”(-),jfm)  -»  (y{-,u),y(t,u))  as  n  ->  oo  and  a  multiplier  (^.MmUm)  € 
M  x  E(0 ,f;  (Ea)*)  x  {Ea)*,  {z0m,Hm,  zm)  ±  0  with 

(3.14)  2:0m>0,  G  (lirninfTM(t)(jf”(-)))  ,  G  (lirnmf  Ty(y^ 
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and  such  that 


z0 mJo  {fo(<r,y{<r,u),v((r))  -  f0(a,y(cr,u),u(s))}da 


+  J  (B*zm(a),v(a)  -  u(a)}da  >  0 

for  every  v(-)  G  Cad( 0,  f;  U;  u)m,  f  where  zm(s)  is  the  solution  of 
dzm(s)  =  -{A*  +  dyf{s,  y(s,  u))*}zm(s)ds 
^  -  zodyfo(s,y(s,u),u(s))ds- Hm(ds),  z(t)  =  zm. 

It  is  then  clear  that,  in  order  to  prove  Theorem  3.1  it  is  enough  to  show:  (a)  if 
necessary  passing  to  a  subsequence 


(3.17)  (^O 772  5  -^m)  *  (^O)  ^0  7^ 

(K  x  C(0,i;£o)  x  £a)-weakly  in  E  X  E(0,  t;  (Ea)*)  x  (Ea)*,  and  (6)  If  f(s)  (resp. 
zm(s))  is  the  solution  of  (3.5)  (resp.  (3.16))  then 

(3.18)  zm{-)  ->  z(-) 

L°°(0,i;E) -weakly  in  Ll(Q,t\E*).  In  fact,  if  both  (3.17)  and  (3.18)  are  satisfied,  we 
take  limits  in  (3.15)  and  obtain 


zojo  {fo{e,y(e,u),v(a))  -  f0(a,y(a,u),u(s))}c 


J  (B*z(a),v{a)  —  u(a)^da  >  0 


for  all  v(-)  €  Cad(0,t;  U )  with  v{a)  -  u(a)  bounded,  and  here  is  how  we  obtain  the 
pointwise  version  (3.4)  of  the  maximum  principle.  Let  m  =  1,2,...,  and  define 
dm  =  {a  G  [0,t];m  <  ||fZ(cr)||  <  m+  1},  Xm  the  characteristic  function  of  dm, 
um(cr)  =  Xm{o)u{o),  em  the  set  of  all  left  Lebesgue  points  of  both  functions  xm(cr) 
and  um(a)  in  [0,  t ];  em  has  full  measure  in  [0,t],  so  that  em  fl  dm  has  full  measure  in 
dm.  If  s  G  em  fl  dm  then  there  exists  a  set  em(h )  C  [s  -  h,  s]  D  dm  such  that 


(3.20)  \\em{h)\  — >  1,  ^  f  \\u(a)  -  u(s)\\da  ^  0  ash-^0  +  . 

fl  fl  Jem\h) 

The  set  U  (em  n  dm)  is  total  in  [0,  t],  thus  so  is  its  intersection  e  with  the  set  of  left 
Lebesgue  points  of  the  following  functions:  (i)  z(-),  (ii)  fo{-,y(-,u),u(-)),  (Hi)  all 
functions  /□(-,  y(-,  u),v).  Take  s  G  e  and  v  G  U,  find  m  so  that  s  G  em  fl  dm  and  stick 
in  (3.19)  the  function  v(a)  =  uxm(c)  +  (l— Xm(v))u(cr).  Then  v(cr)—u(a)  =  vxm(&)- 
Xm(o)u(o)  is  bounded.  We  take  limits  using  (3.20)  and  the  diverse  assumptions  on 
Lebesgue  points,  and  obtain  (3.4). 


t Strictly  speaking,  Theorem  6.2  is  proved  in  [5]  under  the  assumption  that  the  bounds  in  (3.12) 
and  (3.13)  are  uniform,  that  is,  that  the  functions  on  the  left  are  bounded  by  constants  independent 
of  the  controls.  The  proof  extends  almost  without  changes  to  this  more  general  situation  due  to 
the  fact  that  (3.12)  and  (3.13)  are  independent  of  the  particular  control  «(■)  G  Cad(0,T;U,u)m. 
Boundedness  of  v(o)  -  u(a)  makes  it  possible  the  interchange  in  the  order  of  integration  at  the  end 
of  Theorem  6.2. 
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Back  to  (3.17)  and  (3.18).  The  required  convergence  in  (3.17)  can  be  attained 
invoking  Alaoglu’s  Theorem.  To  show  that  the  limit  is  nonzero  note  that,  since 
(zom,  [Mm  Zm)  ^  0  we  may  assume  that 

(3-21)  \zQm\  +  IlMmllj^o, ?;(£;„)*)  T  H^mll^)*  =  1- 

If  zom  — >  z0  ^  0  there  is  nothing  to  prove.  If  z0  =  0  we  may  erase  zom  from  the  left 
side  of  (3.21)  replacing  “  =  ”  by  “  — >  ”.  We  obtain  from  (3.14)  that 

(3.22)  (/tm  2/m(‘))^,c  d-  (Zrm  l/m)a  0 
for  (ym{-),ym)  E  Am  =  Mm  x  ym,  where 

(3.23)  Mm  =  lirn  inf  TM(i)(y”  (■))  C  <7(0, f;  Ea),  ym  =  lirninf  Tr(ji" )  C  Ea, 

where  ( • ,  •  )a  indicates  the  duality  of  Ea  and  (£“)_«  and  ( ■ ,  •  )C:Ct  the  duality  of 
(7(0,?;  Ea)  and  E(0 ,  f;  ( E*)_a ).  The  assumptions  on  M(f)  and  Y  and  [6,  Lemma  3.6] 
guarantee  that  the  Mm  x  ym  contain  a  common  ball  in  Ea  for  m  large  enough,  so 
the  fact  that  (p,  z)  ^  0  follows  from  the  result  below  ([2,  Lemma  2.5]). 

Lemma  3.2.  Let  F  be  a  Banach  space,  {Am}  a  sequence  of  subsets  of  F,  { zm }  a 
sequence  in  F*  such  that 

0<C<  ||zm||F*  ^  (7  <  OO,  (Zm,y)  Ll  ~~Em  *  3  (jj  ^  Am). 

Assume  there  exists  a  precompact  sequence  {Qm}  in  F  such  that  the  sets  conv( Am)  + 
Qm  contain  a  common  ball,  (conv  =  dosed  convex  hull).  Then  every  F— weakly 
convergent  subsequence  of  {zm}  has  a  nonzero  limit. 

It  only  remains  to  show  (3.18).  Writing  vm(s)  =  vmh(s )  +  vmi(s)  as  in  (3.6),  the 
statement  is  obvious  for  the  homogeneous  parts;  in  fact, 

Vmh(s)  ->  vh(s)  0  <s<t, 

E—  weakly  in  E*,  so  that  (3.18)  follows  from  the  dominated  convergence  theorem. 
To  show  (3.18)  for  vmi(s)  we  use  (3.8).  Let  vm{d<j)  =  ((—A)~a)*pm(do),  iz(da)  = 
((— A)~a)*  p(do) .  Then  vm  — *  v  (7(0,  f;  i?)— weakly  in  E(0,  t;  E*).  Given  n  =  1, 2, . . . 
define  an  operator  Ra,P,n{t,  s )  in  the  square  [0,  t]  x  [0,  t)  by  Ra,p,n(f>  s)  =  Ra,P{t,  s)  in 
t—s  >  l/n,  Ra:Pln{t,  s)  =  0  in  t  <  s,  and  extend  Ra,P,n{t ,  s)  linearly  to  0  <  t— s  <  1/n, 
so  that  it  shares  the  bound  (3.9).  Define  vf(s)  with  v(ds)  in  the  same  way  as  vf  s) 
but  using  Ra,P,n(t,s)  instead  of  RaiP(t,s)-,  likewise,  define  v^f  s)  using  vm{ds)  and 
Ra,P,n(t,s).  We  have 

(3.24)  II Vi(s)  -  u”(s)||b.  <  Cpn(s)  =  <7  /  (a  -  s)~a\\v\\(do)  . 

Js 

By  Tonelli’s  theorem, 

(3.25)  f  pn(s)ds<  (  \\i/\\(da)  [  (o  -  s)~ads=  — 1 —  [  \\v\\ (do), 

JO  Jo  Ja—lfn  1  —  OL  \Tl /  J 0 

so  that  ||/9n(-)IU1(o,t)  — >  0  as  n  — >  oo.  Since  {vm(ds)}  is  bounded  in  E(0 ,t\E*)  we 
obtain  in  the  same  way  that 

(3.26)  ||i)mi(s)  -  ^mi(s)ll£*  <  Cpmn(s) 
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for  each  m,  with  ||/w(-)IUw)  ->  0  as  n  ->  oo,  uniformly  with  respect  to  m.  Pick 
/(•)  G  L°°(0 ,t;E)  and  write 

(3.27)  (vi{s) -vmi(s),f(s)) 

=  (vi(s)-vUs),m)  +  ms)-vnmi(s)J(s))  +  <*&(«)  -vmi(s),f(s))  . 

We  have  that  \(vi(s)  -  v?(s),f(s))\  <  \\f\\L°°(p,i;E)  Pn(s)  with  a  similar  estimate  for 
l(^mi(s)  ~  Wi*(s)>/(s))l>  thus  the  first  and  third  terms  on  the  right  of  (3.27)  are 
disposed  of.  For  the  middle  term  we  note  that 

J*  (v  i(s)  -  v^iis),  f{s))ds  =  Jo(^Jo  Ra,o,n(°>  s)f(s)ds>  (y  ~  OW|  . 

hence  it  is  enough  to  show  that  the  function  in  the  left  side  of  the  angled  brackets 
inside  the  integral  on  the  right  is  continuous.  This  is  obvious  and  left  to  the  reader. 

The  first  relation  (3.3)  is  plain;  the  third  relation  follows  taking  limits  in  (zm,  ym)  < 
0  (ym  G  TyiVm )),  and  the  second  results  in  the  same  way. 

4.  The  point  target  case 

There  is  a  version  of  the  maximum  principle  for  (2.1)  [5,  Theorem  9.1]  that  covers 
point  targets.  It  requires  that  /  :  [0,  T]  x  Ea  — >  Es  (a,  6  >  0)  and  that  (— A)sf(t,y ) 
satisfy  Hypothesis  Dafi ■  The  proof  requires  again  M  C  Ea  closed  and  M(t)  T— full 
in  C(0,t;Ea),  but  we  only  need  Y  to  be  a  closed  subset  of  E\  —  D(A ),  thus,  in 
particular  we  may  handle  the  point  target  ^condition  y(t,u)  =  y  G  D(A).  The  control 
space  in  the  present  application  is  C(0,t;U,u)m ,  and  the  multiplier  (zom,ym,zm) 
belongs  to  RxE(0,f;  {Ea)*)  x  (E*)1^,  where  (£l*)Li  is  the  subspace  of  (£'*)_ i  =  (Ei)* 
determined  by  the  condition 

(4.1)  £  \\B*S(t)*z\\xdt  <  oo  , 
and  we  have 

(4.2)  z0m>  0,  Mm  G  (lirninfTM(t-)(C(-)))  ,  G  (lirninf  Ty(y"))  , 

where,  for  each  m,  {($),(■), £m)}  ^  M(*)  x  Y  witl1  C ym(')’Vm )  (y(-,u),y(i,u))  as 
n  -+  oo.  See  [6]  on  extension  of  the  adjoint  semigroup  S(t)*  to  the  spaces  (£l*)_i 
and  other  details.*  The  costate  is  defined  as  in  (3.6)  but  the  homogeneous  term 
is  Vmh(s)  =  ((- A)1-aRai0(i,s;u))*{-Ay1)*zm .  The  minimum  principle  in  integral 
form  is  again  (3.15), 

(4.3)  z0mjo  {fo(^,y^,u),v(a))  -  f0((T,y(<T,u),u{s))}da 

+  J  (B*Zm{c r),  v(a)  -  u(o))d<7  >  0 

*Again,  Theorem  9.1  in  [5]  is  proved  assuming  that  the  bounds  in  (3.12)  and  (3.13)  are  uniform 
and  the  controls  are  bounded.  The  proof  works  under  (3.12)  and  (3.13)  as  long  as  v(a)  -  u(o)  is 
bounded. 
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for  every  v(-)  £  Cad( 0,  t;  U ;  u)m  and 

(4.4)  [  S(t  —  a)Bv{a)da  £  D(A)  . 

Jo 

(note  that,  since  y(t,u)  £Y  C  D(A),  u(-)  itself  satisfies  (4.3)).  The  conditions  above 
do  not  guarantee  that  the  multiplier  is  nonzero.  A  sufficient  condition  comes  below. 
Given  v(- ),  £  Cad( 0,  t;  U),  denote  by  £(t,  u ,  v )  the  solution  £(f)  of 

(4.5)  ?(t)  =  {A  +  dyf(t,y(t,u))}Z{t)  +  B(v(t)  -u{t)),  £(0)  =  0 

and  assume  the  reachable  space  R(t,U;u)m  =  {£(i,u,v);v  £  (7(0,  i;  U,u)m}  satisfies 

(4.6)  R(i,U;u)x  D  B^s), 

Bi(e')  a  ball  of  radius  e  >  0  in  E1.  Then 


(4.7)  (z@ m,  Pui t  Zm)  7^  0. 

When  controls  u(-)  are  bounded,  a  sufficient  condition  for  (4.6)  is  that  U  contain  a 
ball  of  positive  radius  around  some  control  u(-)  satisfying  (4.4).  In  the  general  case, 
it  is  enough  to  require  that  the  set 

(4.8)  |  S(t  —  a)B(v(a)  -  u(a))do\  v(-)  £  Cad( 0,  t;  U,  u)i 


contain  a  ball  Bi(e)  C  Ex.  This  does  not  seem  like  a  good  condition  to  verify  since 
it  contains  the  unknown  optimal  control,  but  it  can  be  checked  easily  in  many  cases 
without  any  information  on  u\  one  example  is  X*  =  L°°(Q),  U  defined  by  the  condi¬ 
tion  u(x)  >  0  a.e.  (see  Section  6). 

We  shall  take  limits  in  (4.3)  in  the  same  way  as  in  (3.15),  and  we  need  to  show 
(3.17)  and  (3.18).  Note  that,  this  time,  (z0m,  fj.m,  zm)  €ix  £(0  ,  (£*)_„)  x  (E*)-x 
and  convergence  in  (3.17)  is  (K  x  Ea  x  Sf)— weak  convergence. 

Inequality  (4.3)  is  equivalent  to 


(4.9)  Z0mto{t,U,v)  +  {pm,^{-,U,v))atC+  {zm,^(t,U,v))x  >  0, 

for  all  v(-)  £  Cad(0,t-,U,u)m  with  £(t,u,v)  £  Ex  (the  latter  condition  equivalent  to 
(4.4)),  where 

& (t,u,v)  =  [  (dyfo{T,y{T,u),u{T)),t(T,u,v))  dr 

(4.10)  Jo  ;  “ 

+  Jo  \fo(r,y{T,u),v(T))  -  /o(r,2/(r,u),u(r))|dr; 

(see  [5]  for  a  proof  of  the  equivalence  of  (4.3)  and  (4.9)).  Since  {z0m,  pm,  zm)  ^  0  we 
may  assume 


(4.11)  4n  +  llMm|||(0,  *,(£„)*)  +  llZmll  (Si)*  ~  1  (m  —  1,  2,  .  .  .  ). 

Select  a  subsequence  of  the  sequence  {(z0m,  pm,  zm)}  such  that  the  limit  (3.17)  exists 
(M  x  Ea  x  Si)— weakly.  If  zom  — >  z0  ^  0,  the  limit  is  nonzero  and  there  is  nothing 
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to  prove.  If  zq  =  0,  we  combine  (4.2)  with  (4.9)  keeping  v(-)  in  Cad(0,t;U,u)i  C 
Cad(0,t]  U,u)m.  The  result  is 

(4.12)  (Pmi  £(■)  v)  ym('))a,c  T  (Zmi  U,  V)  Urn)  1  —  &m  >  0 

for  (ym(-),ym)  6  Mm  X  ym,  Mm  and  ym  given  by  (3.23),  the  second  liminf  (and 
tangent  cones)  computed  in  the  norm  of  Ex.  The  expression  on  the  right  of  (4.12) 
is  justified  by  the  fact  that  ||u((r)  —  fx(cr ) ||  <  1,  so  that  £(t,u,v),  hence  £o(L  u,  v), 
is  bounded  independently  of  u(-)  (take  a  look  at  (4.5)  and  (4.10)).  Accordingly,  to 
apply  Lemma  3.2  it  is  enough  to  show  that  the  sets  Am  defined  by 

(4.13)  (£(-,u,u)  -ym(-),  £(t,u,v)  -ym)  C  C(0,t;Ea)  x  Ex 

(where  «(■)  G  C„d(0,i]U,u)i  and  we  may  take  ym  =  0)  contain  a  common  ball  in 
(7(0,  f;  Ea )  x  Ei.  The  first  coordinate  is  covered  by  ym{-)  alone  on  the  strength  of  the 
assumptions  on  M(f)  and  of  the  fact  that  £(•,«,  w)  is  bounded  in  (7(0,  f;  Ea)\  for  the 
second,  we  use  (4.6).  For  reference,  we  state  the  final  result: 

Theorem  4.1.  Let  u(-)  G  Cad(0,  t;  U )  be  an  optimal  control.  Then  there  exists  a 
double  sequence  {(jf^-),  i/” )}  C  M(t)  xh  C  (7(0, f;  Ea)  x  E\  such  that  (y” ,(•), y” )  — »• 
(y(-,u),y(t,u))  in  C(0,i',Ea)  x  Ex  as  n  — >  oo  and  a  multiplier  (. z0,(J.,z )  G  M  x 
E(0,t\(E*).a)  x  (E*)- 1,  (zo,n,z)  ±  0,  satisfying 

(4.14)  20>0,  mG  (liminf  lhninf  TM(t-}(y"  (•)))  ,  ^G  (liminf  lirn inf 
and 

(4.15)  zo{fo(s,y(s,u),v)-fo(s,y{s,u),u{s))}+(B*z(s),v-u(s))>0  (v  G  U) 
a.  e.  in  0  <t<t,  where  z(s)  is  the  solution  of  (3.5). 

Remark  4.2.  The  integral  form  (3.19)  of  the  maximum  principle  guarantees  that 
z  G  (E*)f i;  the  argument  is  the  same  in  [5,  Theorem  9.1]. 

5.  Nonreflexive  spaces 

The  setup  in  §4  covers  reaction— diffusion  equations  and  the  Navier— Stokes  equa¬ 
tions  in  IP  spaces  1  <  p  <  oo  (see  §6);  however,  there  is  an  advantage  in  treating 
parabolic  equations  in  spaces  C(O)  of  continuous  functions  (see  §6),  thus  it  is  con¬ 
venient  to  extend  the  results  in  §4  to  nonreflexive  spaces.  This  can  be  done  with 
minor  changes;  we  only  need  to  assume  that  E  is  separable.  The  assumptions  on  the 
semigroup  S(t )  and  its  infinitesimal  generator  A  are  the  same  and  the  spaces  Ea  are 
defined  in  the  same  way.  On  the  other  hand,  A*  may  not  be  a  semigroup  generator 
(or  even  densely  defined)  thus  the  spaces  ( E*)a  are  defined  using  ((— A)a)*  rather 
than  (~A*)a.  If  D(A*)  is  not  dense  in  E*,  it  is  no  longer  true  that  (Ea)*  =  [E*)-a] 
however,  the  dual  is  algebraically  and  metrically  isomorphic  under  (3.2)  to  a  larger 
space  (Ea)*  with  (£!*)_„  ^  (Ea)*  ^  (E*)^a+e)  for  all  £  >  0.  Some  of  the  functions 
of  the  dual  E*  are  taken  over  by  the  Phillips  dual  E@  =  closure  of  D(A*)  in  E*\  the 
adjoint  semigroup  S(t)*,  restricted  to  Ee  is  called  Se(t)  and  is  strongly  continuous; 
the  space  Ee  is  maximal  with  respect  to  this  property.  We  name  A&  the  infinitesimal 
generator  of  Se(t).  The  same  considerations  applied  to  the  semigroup  Se(t)  produce 
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(i?®)®,  (A®)®  and  ( S(t)@)G .  We  assume  that  E  is  ©— reflexive  in  the  sense  that 
(Eq)@  =  E;  this  implies  that  (A®)®  =  A  and  (S'(t)®)®  =  S(t).  Hypothesis  Da)P  is 
formulated  in  the  same  way,  but  in  the  case  p  =  0,  f{t,y )  is  allowed  to  take  values 
in  (EQ)*  D  E. 

The  statement  and  proof  of  the  minimum  principle  are  the  same;  the  multiplier 
(z0,  /b  z)  is  in  R  x  E(0,  f ;  ( Ea )*)  x  ( Ea )*  (M  x  E(0,  f ;  (Ua)*)  x  ( Ex )*  for  point  targets). 
We  still  have  ((— A)~a)*{Ea)*  =  E*,  so  that  the  costate  z(s)  is  defined  in  the  same 
way  as  in  the  reflexive  case;  due  to  the  smoothing  properties  of  the  semigroup,  z(s) 
takes  values  in  EQ  rather  than  E* . 

6.  Applications 

Theorems  3.1  and  4.1  for  reflexive  separable  E  can  be  applied  to  the  controlled 
Navier— Stokes  equations 

(6.1)  y'(t)  =  Apy(t)  +  Pp(( y  •  V)y  +  Ipu(tfj  . 

The  space  is  E  =  Xp(Q.)m  =  closure  in  Lp(Q)m  of  all  divergence-free  m-vectors 
y(x)  =  (yi(x), . . .  ,ym(x));  Pv  is  the  projection  of  Z/(Q)m  into  Xp(^l)m  and  Ap  is 
the  Stokes  operator  Ap  —  PpAp,  Ap  the  m-vector  Laplacian  in  Lp{yi)m.  The  control 
space  is  a  (possibly  unbounded)  subset  of  1/(0,  T;  Z7(fi)m),  p  <  r  (for  r  =  oo  we 
take  L“(0,  T;  L°°(Q)m ))  and  Ip  is  the  embedding  operator  from  Lr( fi)m  into  Z/(fi)m. 
Hypothesis  is  satisfied  for  a  <  1  if  p  >  m  and  1/2  +  m/2p  <  a;  in  Theorem 
3.1  one  may  take  r  >  p  arbitrary,  in  particular  r  =  oo.  On  the  other  hand,  the 
assumptions  in  Theorem  4.1  hold  for  p  large  enough  if  1/2  <  a  <  1,  S  <  a  -  1/2);  in 
this  case,  r  =  p.  The  treatment  admits  state  constraints  of  the  form 

(6.2)  Sy(t,x)  €  Ms  Cl‘  (0  <  t  <  t, x  G  fi), 

( Sy(x )  =  ET,r]jk(x)djyk(x)  +  T,rjj(x)yj(x)  a  first  order  differential  operator  with 
k— vector  coefficients  in  C(Cl)k)  and  target  conditions  of  the  same  type.  Nonlin¬ 
earities  more  general  than  the  one  in  (6.1)  are  tractable.  See  [7]  for  details. 

The  main  application  in  nonreflexive  spaces  is  to  uniformly  elliptic  partial  differ¬ 
ential  operators  A(f3)  coupled  with  a  boundary  condition  (3  in  a  bounded  domain 
G  Mm  with  boundary  T.  Here,  E  =  C(Q)  =  {all  continuous  functions  in  12} 
equipped  with  the  supremum  norm;  for  the  Dirichlet  boundary  condition  the  space  is 
the  subspace  Co(fi)  of  C(H)  determined  by  y(x)  —  0  {x  £  T).  Assuming  smoothness 
of  the  coefficients,  the  domain  and  the  nonlinearity  f{t,y)(x)  =  (f>(t,x,y(x),Xy(x)) 
all  results  apply  with  a  >  1/2,  and  state  and  target  constraints  of  the  form  (6.2)  are 
tractable.  See  [5]  for  further  details. 
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Abstract.  In  this  paper,  we  analyze  the  approximate  controllability  and  the  exact-to- 
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1.  Introduction 

During  the  last  years,  controllability  for  deterministic  distributed  parameter  sys¬ 
tems  has  been  intensively  studied.  In  particular,  it  is  known  that  the  heat  equation 
and  the  Stokes  system  with  control  concentrated  in  an  arbitrary  subdomain  are  ap¬ 
proximately  controllable  (see  [11];  see  also  [5]). 

However,  the  analysis  of  the  controllability  for  stochastic  partial  differential  equa¬ 
tions  seems  to  remain  an  almost  open  field  of  research.  The  unique  works  we  know 
on  the  subject  are  [3]  and  [15]. 

In  this  paper,  we  present  some  approximate  and  exact-to-zero  controllability  re¬ 
sults  for  a  class  of  linear  stochastic  partial  differential  systems.  This  includes,  as  a 
particular  case,  a  stochastic  heat  equation  of  the  form 

(  dty  —  A y  =  lov  -I-  B(t)  wt  in  Q  =  D  x  (0,  T)  P  —  a.s., 

|  y  =  0  on  E  =  9ff  x  (0,T)  P  —  a.s., 

1  2/(0)  =  2/o  in  D  P  -  a.s., 

where  the  control  is  v.  Here,  O  and  D  are  bounded  open  sets  in  RN  with  O  C  D,  l0 
is  the  characteristic  function  of  the  set  O  and  wt  =  dtwt  is  a  Gaussian  random  field, 
white  noise  in  time. 

Roughly  speaking,  we  are  going  to  prove  that,  for  general  yo,  yd  and  B,  one  can 
obtain  final  states  y(T)  arbitrarily  close  to  ya  in  quadratic  mean  by  choosing  v  ap¬ 
propriately  (an  approximate  controllability  result).  We  will  also  prove  that,  if  B  is 
not  random  and  in  some  sense  small,  then  one  can  also  choose  v  such  that  y(T)  =  0 
(a  null-controllability  result). 
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We  will  also  study  questions  of  the  same  kind  for  Stokes  and  (more  generally) 
quasi-Stokes  stochastic  systems 


(1.1) 


'  dty3  -  A yj  -  di  ( ai(x)yj )  +  cij(x)yi  +  <9,11  =  loVj  +  Bj(t)  wt, 
Vj  = 

Vj  =  o  on  £, 

.  %(0)  =  Voj  in  D 


(1  <  j  <  N),  where  the  usual  summation  convention  is  assumed. 

This  work  is  a  continuation  of  [15].  An  extended  version,  where  other  more  general 
problems  will  be  considered,  will  appear  in  the  next  future. 


2.  Approximate  controllability  results 

Assume  a  bounded  and  connected  open  set  D  C  RN  with  regular  boundary  c)D, 
a  nonempty  open  subset  O  C  D,  a  positive  number  T  and  a  complete  probability 
space  (fl,  fF,  P}  are  given. 

We  will  use  the  following  notation:  H  =  L2(D),  V  =  Hq(D),  |.|  and  (., .)  are 
resp.  the  usual  norm  and  scalar  product  in  H.  If  X  is  a  Banach  space  and  /  e 
Ll(Q,p-,  X ),  we  denote  by  Ef  the  expectation  of  /: 

Ef=  [  /(w)  dP(w). 

Jn 

Also,  assume  that  a  separable  Hilbert  space  K  and  a  Wiener  process  wt  on  {fI.A,P} 
with  values  in  K  are  given.  This  means  that 

OO 

wt  =  Y,tfek  Vt>  0, 

fc= i 

where  the  $  are  mutually  independent  real  Wiener  processes  satisfying 

2  00 

(2.1)  =Xkt,  ]TAfc<+oo 

k= 1 

and  {ek}  is  an  orthonormal  basis  in  K  (for  the  definition  of  a  real  Wiener  process,  see 
for  example  [1]).  Notice  that,  in  particular,  wt  has  Holder-continuous  sample  paths. 

In  the  sequel,  we  put  Tt  =  a(ws,  0  <  s  <  t)  (the  cr-algebra  spanned  by  ws  for 
0  <  s  <  t).  Obviously,  {Pt}  is  an  increasing  family  of  sub  cr-algebras  of  T  and, 
among  other  things,  one  has: 


For  any  /  e  Ll{Cl,P\  H),  we  denote  by  E[f  \Pt]  the  conditional  expectation  of  / 
with  respect  to  Pt,  i.e.  the  unique  element  in  19(0, H)  such  that 

f  E[f\Pt}dP=  [  fdP  ffAcPt 
J  A  J  A 

(cf.  [14]  for  the  main  properties  of  the  conditional  expectation). 
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Let  X  be  a  Banach  space.  We  denote  by  J2(0,T;  X)  the  space  formed  by  all 
stochastic  processes  $  G  L2(fl  x  (0 ,T),dP  ®  dt\  X)  which  are  ^.-adapted  a.e.  in 
(0,  T),  i.e.  such  that 

$(f)  is  .^-measurable  for  almost  all  t  G  (0 ,  T) 

(in  the  case  X  —  C(K\  H ),  measurability  will  mean  strong  measurability).  Then, 
72(0,  T;  X)  is  a  closed  subspace  of  L2(Q  x  (0,  T),  dP  ®  dt;  X). 

Assume  that  a  stochastic  process  B  is  given,  with 

(2.3)  Bg72(0,T;£(7L;77)). 

Then  the  stochastic  integral  of  B  with  respect  to  wt  is  defined  by  the  formula 

ft  °°  ft 

/  B(s)  dws  =  Y,  5(s)efc  d(3ks  Vi  G  [0,  T], 

J0  do 

Here,  the  convergence  of  the  series  is  understood  in  the  sense  of  L2(f2,  H).  The 

stochastic  integrals  in  the  right  side  are  defined  by  the  equalities 

(jf*  B(s)ek  dpi  h)  =  j\B(s)ek,  h)  d%  VheH, 

where  the  latter  are  usual  Ito  stochastic  integrals  with  respect  to  the  real-valued 
processes  $  (see  [1]). 

Assume  we  are  given  an  arbitrary  but  fixed  initial  state 


(2.4)  y0  G  H 

and  set  A  =  A  (the  usual  Laplace  operator).  For  each  v  G  72(0, T;  77),  there  exists 
exactly  one  solution  yv  to  the  problem 


f  Vv  G  72(0,  T;  V)  0  L2(U;  C°([0,T];  H )), 

|  Vv(t)  =  Vo  +  JQ  {Ayv(s)  +  l0n(s)}  ds  +  B{s)  dws  Vte[0,T]. 

In  (2.5),  the  equalities  hold  P  -  a.s  in  V' .  Let  S(t)  be  the  semigroup  generated  in  H 
by  A,  with  domain  D(A)  =  {h  G  V;  Ah  G  H}.  Then 


(2  6)  {  Vv^  =  S^V°  +  Jo  S^~  ds  +  Jo  S('t  ~  dWs 

1  V*G  [0,T] 

(see  [2],  [13]).  Our  first  result  is  the  following: 


Theorem  2.1.  The  linear  manifold  Yt  =  {yv(T) ;  v  G  /2(0,T;  H) }  is  dense  in  the 
space  L2(Cl,  Tt\  H). 


Proof.  Using  (2.6),  it  suffices  to  check  that,  if  /  G  L2(U,.?t;77)  and 
(2.7)  E(£  S(T-s)(lov(s))ds,f)  =  0  Vu  G  /2(0,T;  77), 
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then  necessarily  /  =  0.  Let  /  be  a  function  in  L2(fl,  Tt\  77)  satisfying  (2.7)  and 
assume  that  ip  £  72(0,  T;  77 )  is  given  by 

(—dttp  —  Aip  =  0  in  Q, 
ip  =  0  on  E, 

<P(T)  =  L 

i.e.  ip(t)  =  S(T  —  t)f.  It  will  be  sufficient  to  prove  that  E[(p(t)\Pt]  =  0  for  all 
t  £  (0 ,  T).  Indeed,  this  and  the  continuity  property  (2.2)  of  the  family  {T1)}  clearly 
imply  /  =  E[tp(T)\fFt]  =  0.  We  know  that 

e[  (v(s),  10i^(s))  ds  =  0  Vv  £  f2(0,  T;  H). 

Jo 

Consequently,  le>-E[yJ(f)|.7:t]  is  a  stochastic  process  in  72(0,T;  H)  such  that 

E  [  ( v(s ),  1 0E[p{s)\Es])  ds  =  f  E(E[(v(s),  lc,V’(s))|^rs])  ds 
Jo  Jo 

< 

=  [T  E(v(s),  1  o<p(s))  ds  =  0  Vve  I2( 0,  T;  H) 

Jo 

and  one  has 

(2.8)  loE[<p(t)\Et]=0. 

For  each  t  €  (0 ,T),  E[ip(t)\Et)  =  S(T  —  t)E[f\Et]  is  real  analytic  in  the  variable 
x  G  D.  Hence,  one  must  necessarily  have  E[ip(t)\Jrt]  =  0  for  all  t  e  (0,T)  and  the 
theorem  is  proved. 

Remark  2.1.  We  deduce  from  theorem  2.1  that,  for  all  ya  6  L2(Q,  fT;  H),  e  >  0  and 
6  >  0,  a  control  v  €  72(0,  T;  77)  can  be  found  such  that 

P{\yv(T)  -  yd\  <  e}  >  1  -  6. 

The  existence  of  a  control  v  e  72(0,  T;  77)  such  that  P{\yv(T)  —  yd\  <  e}  =  1  is  an 
open  question. 

The  assertion  in  theorem  2.1  remains  true  for  systems  governed  by  more  general 
equations.  More  precisely,  one  has: 

Theorem  2.2.  Assume  that,  in  (2.5),  A  e  C{V\V')  is  an  operator  of  the  form 

Ay  =  diiaijdjy )  +  dfbjj)  +  cy, 

where  the  coefficients  satisfy 

aijeC\D),  h,  c  e  L°°(D) 

and  the  usual  ellipticity  condition 

aij(x)Xj\i  >  a|A|2  VA  €  M.N,  Vx  6  D,  a  >  0. 

Then  the  corresponding  linear  manifold  Yp  =  {yv(T) ;  v  £  72(0,  T;  77) }  is  dense  in 
the  space  L2(XI,Pt',  77). 


E.  Fernandez-Cara  and  J.  Real 


145 


Proof.  It  is  analogous  to  the  proof  of  theorem  2.1.  Thus,  let  us  denote  again  by  S(t) 
the  semigroup  generated  by  A  in  H.  Let  A*  and  S*(t )  stand  for  the  corresponding 
adjoint  operators.  Assume  that  /  G  L2(Ll,  H)  and  (2.7)  is  satisfied.  By  putting 
ip(t)  =  S*(T  —  t)f,  one  finds  again  (2.8). 

Unfortunately,  now  E[ip(t)\J-t]  is  not  in  general  real  analytic  in  x  for  0  <  t  <  T. 
So,  we  cannot  deduce  directly  from  (2.8)  that  E\ip{t)\Ti\  =  0  for  all  t  (this  would 
suffice).  However,  using  the  fact  that  the  equation  —  dt<p  —  A*(p  =  0  has  the  unique 
continuation  property,  this  can  be  arranged.  This  is  shown  in  the  following 

Proposition  2.1.  Let  the  assumptions  in  theorem  2.2  he  satisfied.  Assume  that  f  £ 
L2{Q,E;H)  andl0E[S*(T  -t)f\Ft]  =0for  allt  £  (0,T).  Then 

(2.9)  E[S*(T-t)f\Jrt]  =  0  a.e.inD  VfG(0,T). 

Proof.  Our  assumptions  on  A  imply  unique  continuation  for  all  functions  S*(T  —  t)h 
with  h  £  H  (cf.  [16]).  In  particular,  if  h  is  such  that  1  qS*(T  —  t)h  =  0  for  all 
t  £  [71,72],  then  S*(T  —  t)h  =  0  a.e.  in  D  for  all  t  £  [ly,  r2].  Let  us  fix  r  G  (0,  T)  and 
F  £  Tt  and  let  us  prove  that 

(2.10)  [  E[S*(T  —  T)f\FT}  dP  =  0. 

J  F 

Since  r  and  F  are  chosen  arbitrarily,  this  will  imply  (2.9).  We  observe  that,  for  each 
t£[0,T], 

(  S*(T-t)fdP  =  S*(T-t)(E(lFf)). 

Jf 

Also,  from  the  properties  of  conditional  expectation,  one  has 

[  S*(T  -  t)f  dP  =  f  E[S*{T-t)f\Ft]dP  VtG[r,T], 

J  F  J  F 

Hence,  E(lpf)  is  a  function  in  H  such  that 

loS*(T-t)(E(lFf))=  [  loE[S*(T-t)f\Ft}dP  =  0  Vte[r,T). 

JF 

Prom  the  unique  continuation  property,  (2.10)  is  obtained. 

Remark  2.2.  We  can  extend  to  the  stochastic  framework  the  penalization  methods 
in  [7].  More  precisely,  assume  that  yj,  is  given  in  the  space  L2{ U,  Tp\  H)  and  set 

Jk{v)  =  E  fT\v\2dt  +  kE\yv(T)  -  yd |2  Vv  £  72(0,T;  H) 

Jo 

for  each  k  >  1.  It  is  not  difficult  to  prove  the  existence  and  uniqueness  of  a  process 
v k  minimizing  Jk  in  /2(0,  T ;  H).  Let  us  set  yk  =  yik.  Then  one  has 

(2.11)  lim  E\yk(T)  -  yd |2  =  0. 

k—>  00 

Indeed,  if  e  >  0  is  given  and  E\yv(T)  —  yd\2  <  |,  the  following  holds: 

hE\yk(T) -yd\2  <  Jk(vk)  <  Jk(v)  <  +  E  JQ  \v?  dt  V/c>l. 

Prom  this,  (2.11)  follows  easily. 
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Remark  2.3.  If  B  is  not  random  and  satisfies 

BeC°([0,T]-,C(K-  H )) 

and  yd  =  0,  then  it  follows  from  the  results  in  [10]  that  vk  is  a  feedback  control.  To 
be  more  precise,  one  has 

=  l  oQk(f)yk$)i 

where  Qk  is  the  unique  solution  in  C°([0,!T];  £+(#))  to  the  Riccati  equation 

Qk(t)  =  kS(T  -  t)S(T  - 1)  -  jf  S{s  -  t)Qk{s)l0Qk{s)S{s  -  t )  ds 

(here,  C^{H)  denotes  the  set  formed  by  all  self-adjoint  nonnegative  operators  in 
C{H)).  Furthermore,  in  this  particular  case  the  optimal  cost  is  given  by 

Jk{ik )  =  {Qk(0)y0,yo)  +  Jq  trace  B*(t)Qk(t)B(t)W  dt, 

with  W  e  £(H)  being  the  covariance  operator  of  wt. 

Remark  2.4.  Let  us  finally  mention  that  the  duality  methods  in  [7]  also  work  in  this 
context.  More  specifically,  let  Z  stand  for  the  Hilbert  space  L2(Q,  Tt\  H )  (with  norm 
||  •  || z  and  scalar  product  (•,  • )z )■  Assume  e  >  0,  y0  €  H  and  yd  £  Z  are  given.  It  is 
then  natural  to  minimize 

(2.12)  \e  f[  \v\2dxdt 

2  J  JOx(0,T) 

over  the  (nonempty)  set 

(2.13)  /2(0,T;F)n{u;  \\yv(T)  -  yd\\z  <  e}. 

In  the  conditions  of  theorem  2.2,  there  exists  exactly  one  minimizer  (up  to  an  additive 
function  vanishing  on  O  x  (0,  T)).  This  is  given  as  follows.  Let  us  introduce  the  dual 
functional 

j  J(f)  =  \E  JJox(ot)  \<p\2dxdt  +  £\\f\\z-{ydJ)z 

\  V/  €  Z, 

where  yd  =  y<i~  S(T)y0.  Here,  we  have  used  the  notation  fp{t)  =  E[ip(t)\iFt},  with 
ip  e  /2(0,  T;  H)  being  given  by 

f  -dtp  -  A*p  =  0  in  Q, 

<  (p  =  0  on  S, 

l  <p(T)  =  f, 

It  can  be  proved  that  J  :  Z  R  is  strictly  convex,  continuous  and^  due  to  unique 
continuation,  coercive.  Consequently,  there  exists  one  and  only  one  /  G  Z  satisfying 

(2.14)  J(f)  <  J{f)  V/  G  Z. 

Let  /  be  the  solution  to  (2.14).  Then,  if  we  set  v(t)  =  E[(p{t) for  all  t,  it  is  not 
difficult  to  check  that  Iqv  is  the  control  process  minimizing  (2.12)  in  the  set  (2.13). 
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3.  A  null-controllability  result 

In  this  section,  we  present  a  null-controllability  result  for  (2.5).  Again,  this  is  the 
analog  of  a  deterministic  result. 

Let  us  fix  a  positive  function  7  £  C°°(0,T)  such  that  7 (t)  =  t  near  t  =  0  and 
7 (t)  =  T  —  t  near  t  =  T.  It  will  be  assumed  that  the  hypotheses  in  theorem  2.2 
hold,  that  B  is  not  random  and  satisfies  B  £  C'1([0,  T1];  £(K;  H ))  and,  also,  that  the 
support  of  B(t)  does  not  intersect  O  for  all  t. 

Theorem  3.1.  There  exists  a  positive  function  p  =  p[x)  such  that,  if 
(3.1)  JJ^  t  (7(t)_1  \\B\\2C(K.H)  +  7(t)3\\dtB\\2C(K; H)^J  <+00, 

then,  for  each  y0  £  H  there  exists  v  £  I2(0,  T;  H)  satisfying  yv(T )  =  0. 

Sketch  of  the  Proof.  We  will  adapt  the  arguments  in  [8]  in  the  context  of  (2.5).  We 
will  previously  rewrite  the  null-controllability  problem  as  an  equivalent  problem  for 
which  y0  =  0. 

(i)  Let  9  =  6{t )  be  a  C°°  function  such  that  9(t)  =  1  near  t  =  0,  6{t )  =  0  near 
t=T  and  0  <  6  <  1.  Let  us  introduce  the  function  £,  with 

£(f)  =  S(t)y0  Vf. 

Then,  by  setting  g  =  ~9'{t)£  and  zv  =  yv  -  0(t)£(t),  one  sees  that  yv{T)  =  0  if  and 
only  if  the  unique  solution  to 

{dtzv  -  Azv  =  Iqv  +  g  +  Bwt  in  Q  P  -  a.s., 
zv  =  0  on  E  P-  a.s., 
z„(0)  =0  in  D  P  -  a.s. 

satisfies  zv(T)  =  0. 

(ii)  Following  the  ideas  of  [8],  let  us  introduce  (and  solve)  an  auxiliary  variational 
problem. 

Let  O'  be  a  nonempty  open  set  satisfying  O'  CC  O.  We  will  put  D'  =  D\  O', 
Q'  =  D'  x  (0,  T)  and  S'  =  dD'  x  (0,  T).  As  usual,  0'v  will  stand  for  the  open  77- 
neighborhood  of  O'.  The  usual  co-normal  derivative  operator  associated  to  A  and 
A*  will  be  denoted  by  Oa-  We  will  need  the  following 

Proposition  3.1.  There  exist  a  positive  function  p  £  C2(D )  and  a  positive  constant 
C *  such  that 

II  7  ( \dtq\ 2  +  \D2q\2)  e~2?  +  JJ^  7_1|Vg|2e-2^  +  JJ^  7“3k|2e-2^ 

£C*(//0.|%'M‘,|V,;) 

for  all  functions  q  £  C2{Q')  such  that  q  =  0  on  S'  and  dAq  =  0  on  80'  x  (0,  T). 

This  Carleman  inequality  is  proved  in  [9]  (see  also  [6]  and  [17]  for  other  more 
general  estimates  of  the  same  kind).  In  the  sequel,  it  will  be  assumed  that  B  satisfies 
(3.1)  with  p  furnished  by  proposition  3.1.  We  will  prove  that,  for  some  v ,  one  has 
zv(T)  =  0. 
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Let  us  introduce  the  linear  space 

$0  =  {?e  C 2(Qi) ;  q  =  0  on  £',  dnq  =  0  on  dO'  x  (0, T)  }. 

Prom  proposition  3.1,  we  know  that 

[p>  q]  =  fjQSdtP  +  A* p }(dtq  +  A*ti  e”2^ 

is  a  scalar  product  in  T0.  Let  'I'  be  the  completion  of  'I;0  for  the  scalar  product  [•,■]. 
Then  (3.3)  is  satisfied  for  all  Let  us  put  P  =  L2(Cl,  T),  a  Hilbert  space  for 

the  scalar  product 

E[p,  q\=E  JJ  ( dtp  +  A*p)  (dtq  +  A*  q)  . 

Let  us  also  put 

(l,  q)  =  ~E  JJ  gq  +  E  JJ^  ( qdtB  +  dtqB)  wt  q  £  P. 

Then,  using  (2.1),  (3.1)  and  proposition  3.1,  it  can  be  shown  that  l  is  a  bounded 
linear  form  on  P.  Arguing  as  in  [8],  we  introduce  the  following  problem: 

(3.4)  E\p,  q}  =  (l,  q)  V q  £  P,  p  £  P. 

Obviously,  (3.4)  possesses  exactly  one  solution  p.  Let  us  put  z  —  e~2*(d tp  +  A*p). 
Among  other  things,  one  has  z  £  L2(fl,P\L2(Q')). 

(iii)  It  can  be  seen  that  z  has  sample  paths  in  CQ([Q,T]-,H~l(D’))  and  satisfies 

(z(t),  q0)  =  J*  {(*(«),  A*q0)  +  (g(s),  q0)}  ds  +  (jf  B(s)  dws,  q0) 

P- a.s.  for  all  t  £  (0,  T)  whenever  q0  is  (for  instance)  a  function  in  Cg°  (£>').  The 
stochastic  integral  arises  as  a  consequence  of  Ito’s  formula  (here,  the  fact  that  B  is 
not  random  is  needed).  It  is  thus  clear  that  z  £  J2(0,T;  L2(D')).  Let  6  be  such  that 
0  <  S  <  |  dist  (O',  dO)  and  let  x  G  C°°(D)  be  a  cut-off  function  satisfying 

0<X<1,  x  =  0  in  0's,  x=l  in  D  \  0'2S. 

Then,  z  =  xz  can  be  extended  by  zero  to  the  whole  domain  Q.  Its  extension, 
also  denoted  by  z,  satisfies  z  £  I2(0,T]H)  and  also  the  following  equalities  for  all 
q0  £  D(A*)  and  all  t  £  (0,  T): 

(z(t),  q0)  =  Jq  {(z(s),  A*q0)  +  ( g(s ),  g0)}  ds  +  (J^  B(s )  dws,  q0) 

ft  _ 

+  /  {(biz(s),  diq0 )  +  (cz(s)  +  (x  -  1)3(5),  go)}  ds. 

Jo 

Here,  b{  =  2atj  djx  and  c  =  Q?x  -  h  dot ■  From  known  results,  we  are  now  able  to 
ensure  that  z  £  /2(0,T;  V")  (for  example,  see  [2],  [13]). 

(iv)  Let  us  introduce  a  second  cut-off  function  x  G  C°°(D),  with 

0<x<  1,  x  =  0  in  0'2S,  X=1  ™D\0'3S. 
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By  putting  2  =  x  z,  we  see  that,  for  some  v  £  I2( 0,  T\H), 


(3.5) 


( z(t ),  q0)  =  J  {(^(s),  A*q0)  +  ( g{s ),  q0)}  ds  +  B(s)  dws,  q0) 
+  J*(lov(s),q0)ds  Vq0£D(A*),  Vte(0,T)  P  -  a.s. 


From  (3.4),  (3.5)  and  the  fact  that  2:  £  72(0,T;  V),  it  is  easy  to  deduce  that  z  =  zv 
(the  unique  solution  to  (3.2))  and  also  that  z(T)  =  0. 


4.  The  case  of  the  Stokes  and  quasi-Stokes  systems 

The  assertion  in  theorem  2.2  also  holds  in  the  case  of  a  quasi-Stokes  stochastic 
system  of  the  form  (1.1)  with  bounded  coefficients  a*  and  cy.  To  be  more  precise,  let 
us  introduce  the  space 

V  =  {ip  £  C£°(D)n  ;  V  •  y  =  0  in  D} 

and  let  us  denote  by  V  (resp.  H)  the  closure  of  V  in  Hq(D)n  (resp.  L2(D)N).  In  this 
section,  A  will  stand  for  the  operator  in  C(V ;  V')  given  by 

(Ay,  z)  =  -  jjyy  ■  Vz  +  al(x)yjdizj  +  cij(x)yizj}  dx  Vy,  z  £  V, 

where  it  is  assumed  that 

au  Cij  £  L°°(D). 

Regarded  as  an  unbounded  operator  on  H  with  domain 

D(A)  =  {y£V-,  Ay£H}, 

A  is  the  generator  of  a  semigroup  on  H,  again  denoted  by  S(t).  Assume  B  and  y0 
are  given  and  satisfy  (2.3)  and  (2.4).  For  each  v  £  72(0,  T;  L2(D)N),  there  exists  one 
and  only  one  solution  yv  to  the  problem 

r  yv  £  /2(0,T;  V)  H  L2(Q-  C°([0,T];  H)), 

\  Vv(t)  =Vo  +  Jo  {Ayv(s)  +  l0v(s)} ds  +  B(s)dws  Vfe[0,T]. 

In  fact,  if  PH  stands  for  the  orthogonal  projector  from  L2(D)N  onto  H  (the  Leray 
operator),  then  yv  is  given  by  the  following  identities: 

|  yv(t)  =  S(t)y0  +  jT  S(t-  s)  [Pff(l0n(s))]  ds  +  jT  S(t  -  s)B(s)  dws 
1  Vi  6  [0,T], 

Using  this  and  the  unique  continuation  property  established  in  [4] ,  we  can  argue  as 
in  the  proofs  of  theorem  2.2  and  proposition  2.1.  The  conclusion  is: 

Theorem  4.1.  With  the  notation  used  in  this  section,  the  linear  manifold  Yt  = 
{yv(T);  v  £  I2(0,T;  L?(D)N) }  is  dense  in  the  space  L2(Q,,Tt\ H). 
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Remark  4.1.  For  systems  governed  by  stochastic  quasi-Stokes  problems  of  this  kind, 
we  can  adapt  the  arguments  in  Remarks  2.2,  2.3  and  2.4.  Thus,  similar  penalization, 
feedback  and  duality  results  can  be  obtained. 

Remark  4.2.  If  (1.1)  is  the  stochastic  Stokes  problem  (i.e.  at  =  Cij  =  0),  it  is  possible 
to  prove  that,  for  fixed  1  <  j  <  N,  the  set 

{llv(T) ;  v  G  72(0,T;  L2(D)N),vj  =  0} 

is  also  dense  in  The  proof  is  as  in  the  similar  deterministic  case 

(cf.  [11]). 

Remark  4.3.  For  the  stochastic  3D  Stokes  problem  in  a  cylindrical  domain  D  = 
G  x  (0,  L),  one  also  has  approximate  controllability  in  a  “generic”  sense  with  respect 
to  G  with  controls  in  the  set 

Uad  =  {ve  I\ 0,  T;  L\Df) ;  Vl  =  v2  =  0}. 

More  precisely,  using  the  results  in  [12]  and  arguing  as  above,  it  can  be  seen  that, 
for  any  given  bounded  domain  G  C  R2  of  class  Ck  (with  k  >  3),  there  exists  another 
domain  G  arbitrarily  close  to  G  in  the  Ck  topology  such  that  the  corresponding  set 
{yv{T) ;  v  £  Uad }  is  dense  in  L2{Q,,Ft\H). 

To  our  knowledge,  whether  or  not  null-controllability  (i.e.  theorem  3.1)  holds  for 
systems  governed  by  (1.1)  is  an  open  question. 
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Abstract.  This  article  presents  a  reduced  basis  method1  for  constructing  a  reduced  order 
system  for  control  problems  governed  by  nonlinear  partial  differential  equations.  The  major 
advantage  of  the  reduced  basis  method  over  others  based  on  finite,  element,  finite  difference 
or  spectral  method  is  that  it  may  capture  the  essential  property  of  solutions  with  very  few 
basis  elements.  The  feasibility  of  this  method  is  demonstrated  for  boundary  control  prob¬ 
lems  modeled  by  the  incompressible  Navier-Stokes  and  related  equations  with  the  boundary 
temperature  control  and  boundary  electromagnetic  control  in  channel  flows. 

1991  Mathematics  Subject  Classification.  93B40,  49M05,  76D05,  49K20,.  65H10,  76W05, 
80A20 

Key  words  and  phrases.  Reduced  basis  method,  finite  elements,  optimal  control,  flow  con¬ 
trol,  boundary  temperature  control,  electromagnetic  control. 

1.  Introduction 

Real  time  simulations  of  control  problems  that  involve  partial  differential  equations 
as  state  equations  are  often  formidable  problems  to  solve.  Our  work  was  motivated  by 
the  recent  interest  in  optimal  flow  control  of  viscous  flows  which  are  control  problems 
involving  Navier-Stokes  equations  as  state  equations;  see  [11]  for  a  review.  These 
problems  are  by  far  the  most  challenging  control  problems  in  computational  engi¬ 
neering  and  science.  The  major  difficulty  is  mainly  due  to  the  nonlinearity  in  the 
state  equations  and  these  state  equations  when  discretized  can  number  in  millions. 
Thus  the  conventional  approaches  cannot  be  adequate  for  solving  such  large  scale 
control  problems 

In  this  article  we  discuss  a  reduction  type  method  which  may  provide  an  avenue 
to  overcome  this  difficulty.  In  this  method  hereafter  called  reduced  basis  method 
one  uses  basis  functions  which  are  closely  related  to  and  generated  from  the  problem 
that  is  being  solved.  This  is  in  contrast  to  the  traditional  numerical  methods  such 
as  finite  difference  method  and  finite  elements  method  which  uses  grid  functions  and 
piecewise  polynomials,  respectively,  as  basis  functions. 

There  are  several  approaches  available  for  the  selection  of  basis  functions  in  reduced 
basis  method.  One  such  approach  is  Taylor  approach  in  which  one  uses  solutions  at  a 
reference  point  in  the  parameter  space  along  with  their  derivatives  as  basis  functions. 

This  work  was  supported  by  the  Air  Force  Office  of  Scientific  Research  under  grants  AFOSR 
F49620-95- 1-0437  and  AFOSR  F49620-95-1-0447.  This  work  was  partially  supported  by  the  Office 
of  Naval  Research  Grant  N00014-96- 1-0265. 
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Another  approach  which  we  call  Lagrange  approach  uses  solutions  of  the  problem  at 
various  parameter  values  as  basis  functions.  Finally  the  Hermite  approach  is  a  hybrid 
of  Lagrange  and  Taylor  approaches  which  uses  solutions  and  their  first  derivatives 
of  the  problem  at  various  parameter  values  as  basis  functions.  The  applications 
of  reduced  basis  method  to  structural  mechanics  problems  can  be  found  in  [1]  and 

!6!  i7i- 

Our  goal  here  is  to  demonstrate  the  applicability  and  feasibility  of  reduced  ba¬ 
sis  method  for  control  problems  governed  by  Navier-Stokes  type  partial  differential 
equations.  We  will  consider  vorticity  minimization  problems  in  backward-facing  step 
type  channel  geometry  as  a  prototype  control  problem.  Two  fluid  flow  situations 
are  considered:  An  electrically  conducting  fluid  under  applied  magnetic  field  and  a 
thermally  convective  fluid.  In  the  first  situation  the  control  is  effected  by  boundary 
electric  potential  and  in  the  latter  the  control  is  boundary  temperature. 

Electromagnetic  Control.  When  the  fluid  is  electrically  conducting,  such  as  sea 
water,  one  can  obtain  an  interesting  control  mechanism  by  appropriately  placing 
electrodes  and  magnets  along  the  boundary  of  the  flow  domain  such  that  there  is 
a  coupling  between  magnetic  field  B  and  the  current  j,  see  Figure  1.1  for  one  such 
setup.  This  coupling  produces  a  forcing  j  x  B  which  appears  in  the  Navier-Stokes 
equations  and  is  known  as  Lorenz  force,  see  [10].  This  forcing  can  be  exploited  to 
control  fluid  flows.  In  §5,  we  will  descirbe  this  control  mechanism  and  in  §6  we 
will  use  this  for  a  vorticity  minimization  problem  in  fluid  flows  to  demonstrate  the 
feasibility  of  reduced  basis  method. 


Figure  1.1.  A  simple  diagram  of  MHD  setting 
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Boundary  Temperature  Control.  By  imposing  a  temperature  gradient  in  the  flow 
by  adjusting  boundary  temperature,  one  can  either  enhance  or  counteract  the  flow 
effects.  In  §5-6,  we  will  use  this  control  mechanism  for  a  vorticity  minimization 
problem  in  fluid  flows  to  demonstrate  the  feasibility  of  reduced  basis  method. 

A  Vorticity  Minimization  Problem.  As  a  prototype  problem  for  vorticity  minimiza¬ 
tion  in  fluid  flows,  let  us  consider  the  flow  through  backward-facing  step  channel  at 
Reynolds  number  200.  For  this  Reynolds  number  range  flows  can  be  assumed  to  be 
steady  and  two  dimensional.  Thus  we  consider  a  steady  plane  flow  in  a  backward¬ 
facing  step  channel  as  shown  in  Figure  6.1.  The  corresponding  flow  at  this  Reynolds 
number  is  shown  in  Figure  6.3  which  has  a  corner  circulation  near  the  corner.  Our 
objective  is  to  suppress  this  recirculation  by  using  the  control  mechanisms  discussed 
previously.  We  formulate  this  as  an  optimal  control  problem  with  a  cost  functional 
representing  vorticity  in  the  flow  which  is  thus  minimized  subject  to  the  governing 
equations  of  the  flow  under  consideration. 

We  will  consider  the  two  control  mechanisms  described  previously  in  two  different 
flow  setting.  That  is  flow  is  either  electrically  conducting  or  thermally  convective  and 
we  will  consider  electromagnetic  control  in  the  former  and  boundary  temperature 
control  in  the  latter. 

2.  Reduced  Basis  Spaces  and  Reduced-Order  Equation 

In  order  to  illustrate  the  reduced  basis  method,  we  assume  for  ease  in  exposition 
that  we  are  dealing  with  nonlinear  dynamics  about  the  equilibrium  points.  Consider 
the  the  parameterized  stationary  problem 

(2.1)  £(y,  m)  =  0  for  y  €  A,  y  €  X, 

where  y  represents  some  physical  parameter,  for  example,  Reynolds  number  or  vis¬ 
cosity  and  £  :  X  X  A  — >  X*  is  C2.  Equation  (2.1)  defines  a  solution  function 
fj,  &  A —>  y(fi)  G  X.  We  construct  the  reduced  basis  elements  by  the  interpolation  of 
solution  function  y  — >  y(y)  as  follows. 

The  Taylor  Subspace.  In  this  choice,  one  uses  the  Taylor  expansion  of  function 
y{y)  at  a  reference  value  of  y,  say  y*,  and  the  reduced  basis  subspace  Xr  is  defined 
as 

(2.2)  XR  =  spangly,-  =  ^\^,j  =  0, . . . ,  M} 

The  jth  derivative  yj  can  be  calculated  from  the  equations  resulted  by  successive 
differentiation  of  (1.1),  i.e. 

(2.3)  Sy(yo,  yo)yj  =  Xj{y0,  Vu-  -  - ,  2/j-i,  H*)- 
For  example,  yi  satisfies  the  equation 

£y{yo,v*)yi  =  -£,i(y<hV*)- 

We  note  here  that  each  yj  can  be  obtained  from  its  predecessors  by  solving  a  linear 
system  with  the  same  linear  operator  £v(y0,y*).  However,  one  cannot  continue  to 
use  the  same  basis  elements  generated  at  fixed  parameter  y*  to  compute  solutions 
when  the  parameter  of  interest  is  significantly  away  from  it.  In  such  cases  reduced 
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basis  elements  have  to ’be  updated  and  the  solution  is  sought  in  the  new  reduced  basis 
space.  Moreover,  generating  the  right  hand  side  of  (2.3)  could  be  quite  complicated 
in  certain  problems.  This  choice  has  been  used  in  [7]  for  structural  analysis  problems 
and  in  [8]  for  high  Reynolds  number  steady  state  fluid  flow  calculations. 

The  Lagrange  Sub-space.  In  this  case,  the  basis  elements  are  solutions  of  the  non¬ 
linear  problem  under  study  at  various  parameter  values  pr  The  reduced  subspace  is 
given  by 

(2.4)  XR  =  span  {i/ |i/  =  y(pj),j  =  1 

The  Lagrange  basis  was  used  to  study  structural  problems  in  [1].  A  possible  advantage 
in  this  choice  is  that  updating  the  basis  elements  can  be  done  one  basis  element  at  a 
time  instead  of  generating  the  whole  space. 

The  Hermite  Subspace.  This  is  a  hybrid  of  the  Lagrange  and  Taylor  approach.  The 
basis  elements  are  solutions  and  their  first  derivatives  at  various  parameter  values  p.j. 
The  reduced  subspace  is  given  by 

dy  — * 

(2.5)  XR  =  span{?/J  =  y(pj)  and  j  =  !,■■■,  M}. 

Suppose  we  have  :a  reduced  basis  space  XR  in  X.  Let  m  =  dim(XR)  and  {<^}  is 
a  basis  of  Xr.  Then  we  can  construct  the  reduced-order  equation  by  the  Galerkin 
approximation,  i.e.,  tfor  ym  =  YT=i  e  Xr 

(2.6)  p)i  =  M)>  4>i}X*'S.X 

for  1  <  i  <  m. 

For  the  evolution  equation 

(2.7)  ^y(t)  +  E(t,y(t))  =  0 

we,  for  example,  generate  reduced  basis  elements  in  X  by  the  solutions  at 

m  different  time  instants  to  (2.7).  Given  the  reduced  basis  space  Xr,  we  define  the 
reduced-order  equation  for  ym(t )  =  J2iLi<Xi{t)<Pi, 

(2.8)  ( -j^Vm (t)  +  E(t,ym(t)),(f>i}x-xx  =  0 

for  all  1  <  i  <  .to.  In  [4]  this  method  has  been  carried  out  and  its  feasibility  has 
been  demonstrated  for  channel  flow  simulations  in  which  reduced  order  solution  um 
is  formed  by  setting 

m 

um(i) 

2=1 

where  (j),  =  ui+1  — >Uj,  i  —  1, 2, . . .  ,m  -  1  and  <f>n  =  um.  We  further  take  arn  =  1  so 
that  the  boundary 'Conditions  are  satisfied.  The  solution  um  is  computed  from 

(f  um,vm)-+  A-a(um,vm)-+  c(um,um,vm)  =  (f,  vm)  for  all  vm  G  VJ*, 

where  V™  =  span{^  :  i  =  1, . . . ,  m  -  1}  is  the  span  of  the  test  functions. 

The  basis  elements  were  generated  by  computing  the  flow  from  the  full  model  at 
eleven  time  instances  between  1  and  11.  The  time  step  used  in  the  reduced  order 
model  was  .001. 
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The  dimension  of  the  reduced  basis  space  is  very  much  problem-dependent.  The 
reduced  basis  elements  constructed  by  the  above  mentioned  approach  can  be  nearly 
linearly  dependent.  So  we  may  further  reduce  the  dimension  by  the  conditioning  of 
the  mass  matrix  Q: 

Qi,j  {fn 

Our  computational  experiments  and  the  computations  reported  for  structural  prob¬ 
lems  in  the  references  mentioned  earlier  seem  to  indicate  that  an  accurate  approx¬ 
imation  can  be  obtained  for  large  range  of  parameter  values  using  5  to  10  basis 
elements.  Therefore,  although  the  resulting  reduced  order  model  is  dense,  they  are 
small  compared  to  the  sparse  but  large  system  that  result  from  the  standard  basis 
functions. 

According  to  our  comparison  study  carried  out  in  [4]  for  driven  cavity  incompress¬ 
ible  Navier-Stokes  calculations,  the  performance  of  Hermite  approach  is  better  than 
that  of  Lagrange.  The  basis  elements  for  the  Lagrange  approach  were  selected  at 
Reynolds  numbers  100,  300,  500  and  700,  and  that  for  the  Hermite  was  selected  at 
300  and  700.  The  comparison  was  carried  out  by  computing  the  driven  cavity  flow  at 
Reynolds  number  1200.  The  L2-norm  difference  between  the  full  mixed-finite  element 
solution  u f  and  the  reduced  basis  solution  using  these  two  approaches  are  as  follows: 
||u;-u/||2  =  0.0889  and  ||uft  —  u/||2  =  0.0766,  where  u;  is  the  solution  obtained  using 
Lagrange  approach  and  iq,  is  that  obtained  using  Hermite  approach. 

3.  Error  Analysis 

In  order  to  justify  the  reduced  basis  solution  ym  we  need  to  have  a  post  verification 
criterion.  In  general  we  formulate  it  as  an  error  analysis  as  follows.  Let  X  and  Y  be 
two  Banach  spaces  and  A  be  a  compact  set.  Given  a  C 2  mapping 

£  :  (y,y)  G  X  x  A  ->  £{y,y)  <E  Y, 

and  we  consider  the  equation 

(3.1)  £(y,iA  =  0- 

The  family  {(y(y),y)  :  y  G  A}  is  said  to  be  a  branch  of  nonsingular  solutions  of 
equation  (3.1),  i.e. , 

y  — >  y(n)  is  a  continuous  function  from  A  into  X  and  Dv£(y1  y)  is 
an  isomorphism  from  X  onto  Y  for  all  y  G  A. 

Let  us  consider  the  reduced  order  problem 

(3.2)  £m(yrn,y)=  0. 

defined  on  the  reduced  basis  space  Xm.  We  assume  that  :  Xm  x  A  Ym  is  C2. 
For  the  ease  of  our  discussions  we  assume  that  Xm  C  X  and  Ym  CY.  The  norms  on 
Xm  and  Ym  are  induced  from  X  and  Y  norms,  respectively.  The  problem  is  to  find 
the  solution  ym  G  Xm  such  that  (3.2)  is  satisfied  for  a  given  /i  G  A. 
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We  assume  that  Dy£m(ym,p)  is  an  isomorphism  from  Xm  onto  Ym  where  ym  is  a 
given  element  in  Xm.  We  introduce  the  following  notations; 

^m(A0  =  ||£m(2/m;  h)  i[ym  > 

fmi.P)  —  \\Dy£m(jjmi  A4)  UcCKmAn)) 

Sm(u\  a)  =  {vGXm:  ||u  -  v\\Xm  <  a}, 

Lm(p;a)  =  sup  \\Dy£m(ym,  p)  -  Dy£m(v,  p)\\c(xm,Ym)- 

v£S(ym,a) 

We  next  state  a  theorem  regarding  the  error  estimate  which  is  derived  from  Theorem 
IV.3.1  in  [2]  for  the  approximation  of  branches  of  nonsingular  solutions. 

Theorem  3.1.  Suppose  Dy£m(ym,  p)  is  an  isomorphism  of  Xm  onto  Ym  and 

Then  the  problem  (3.2)  has  a  unique  solution  (ym{p),p)  such  that: 

ym{p]  €  S(ym,  2r)m{p)em{(i)) . 

In  addition,  ym(y)  is  the  only  solution  of  (3.2)  in  the  ball  Sm(ym;a)  for  all  a  > 
2^m(p)em(iJ,)  that  satisfy  /ym(iJ.)Lm(p]  a)  <  1  and  we  have  the  estimate: 

\\ym(y)-Vm\\x  <  hm(T)/(l-Tm{y)Lm{p\a)]\\£m{vm,p,)\\Y  for  all  vm  6  Sm{ym,a). 


Moreover,  we  have  the  following  corollary. 

Corollary  3.2.  Suppose  there  exists  an  element  ym  €  Xm  such  that  Dy£(ym,p)  is  an 
isomorphism  of  X  onto  Y  and 

(3.3)  27 (p)L(p,  2'y{p)e{p))  <  1 

where 

e(A0  =  \\£(ym,v)\\Y, 

T(m)  =  II Dy£{ymi  y)  IU(K,X), 

S{y,  a)  =  {v  e  X  :  ||w  -  v\\x  <  a}, 

L(p;a)  =  sup vES(ym,a)  \\Dy£(ym,T)  ~  Dy£(v, p)\\C(x,Y)- 
Then  the  problem  (3.1)  has  a  solution  (y(fi),p)  such  that: 

y(h)  e  S{ym]2'y{p)e{p)). 

In  addition,  y(p)  is  the  only  solution  of  (3.1)  in  the  ball  S(ym\a)  for  all  a  > 
2ry{p)e{p)  that  satisfy  7 (p)L(p',  a)  <  1  and  we  have  the  estimate: 

Mt)-v\\x  <  \^{p)/{l-^{p)L{p-a)]\\£{v,p)\\Y 

for  all  v  £  S(ym ,  a).  ■ 
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We  can  apply  Theorem  3.1  and  Corollary  3.2  to  obtain  the  following  error  estimate. 

Theorem  3.3.  (i)  Suppose  ym( p)  G  Xm  is  a  solution  to  (3.2)  and  assume  ym  = 
ym(p)  satisfies  the  condition  in  Corollary  3.2.  Then  we  have  a  solution  y(y)  G 
S(ym]2'-j(p)e(iJ,))  to  (3.1)  and  the  estimate 

(3.4)  \\y(n)  -ym(lJ-)\\x  <  [7(^)/(l  -7(M)i(M;a)]||f(ym(M),M)l|y 

with  a  =  7 (n)e(fj,). 

(ii)  Suppose  there  exits  an  element  ym  G  Xm  such  that  the  conditions  in  Theorem 
3.1  and  Corollary  3.2  are  satisfied.  Furthermore,  we  assume  that  am  =  2'ym(/i)em(iJ,) 
satisfies  j(fi)L(n;  am )  <  1.  Then  we  have  (3 A)  with  a  =  an.  ■ 

4.  Optimal  Control  Problems 

In  this  section  we  discuss  the  optimal  control  problem  and  the  application  of  re¬ 
duced  basis  method.  Consider  the  minimization  problem 

(4.1)  min  J(y,u)  subject  to  £(y,u)  =  0 

over  u  G  Uad  C  U.  Here  X  and  U  are  Hilbert  spaces  and  £  :  X  x  U  — >  X*  is  C2.  We 
assume  that  U  =  Rm  and  uad  is  closed  and  convex.  The  Lagrange  reduced  space  can 
be  defined  by 

Xr  =  span{y  G  X\£(yj,uj)  =  0,  j  =  1  ,...,M}, 

where  is  a  sampled  point  in  Uad-  In  order  to  obtain  a  lower-order  reduced  basis 
space,  if  m  is  large  then  we  may  consider  the  following  pre-selecting  step: 

•  Let  ua,  a  €  A,  be  the  points  in  Uad  defined  by 

ua  =  u  +  J2  e» 

where  a  is  the  integer-valued  vector,  and  6i  and  e*  are  the  step  size  and  unit 
vector  in  the  2-th  direction. 

•  We  determine  ya  G  X  by  solving  £(y ,  ua)  for  each  a  G  A. 

•  We  find  an  index  ao  in  A  such  that  J(ya,ua)  is  smallest. 

•  Then,  we  select  the  sampling  set  Uj  by 

u1  -  ua°,  u2t  =  uao  +  6iei,  and  u2l+1  =  uao  —  6ie{. 

The  Hermite  reduced  space  can  be  defined  by 

dv 

Xr  =  spanly-7  G  X  x  U\£(yJ,uJ)  =  0  and  — — ( u: ),  1  <  i  <  m,  j  =  1, . . .  ,M}. 

OUi 

Here,  can  be  calculated  by  solving  the  sensitivity  equation 

(4.2)  £y(y3  ,u3)ff  =  -£Ui  ( y0 ,  u3). 

Here  we  can  use  the  pre-selecting  step  to  select  u3  as  for  the  Lagrange  case. 

Suppose  we  have  the  reduced  basis  space  Xr.  Then  we  use  the  Galerkin  method 
to  project  the  equation  onto  Xr,  i.e.,  ym  =  YffLi  aifi  G  XR  satisfies 

£m{ym,u)i  =  {£{y,u),4>i)x>xx. 


160 


A  Reduced  Basis  Method  for  Control  Problems  Governed  by  PDEs 


Then  we  consider  the  reduced-order  control  problem 

(4.3)  min  J(ym,u)  subject  to  £m(ym,u)  =  0  and  u  £  Uad. 

It  is  a  finite  dimensional  constrained  minimization  problem  and  can  be  readily  solved 
by  the  constrained  optimization  methods.  A  necessary  optimality  condition  is  given 


by 

(  (DySmiUm 

(4.4) 

( Du^’mi^ym 

for  all  u  £  Uad,  assuming  Dy£m  at  the  optimal  pair  ( ym,um )  to  (4.3)  is  an  isomor¬ 
phism.  Similarly,  we  have  the  necessary  optimality  condition  for  (4.1):  there  exists  a 
Lagrange  multiplier  A  £  X  such  that 

[  (Dy£(y*,u*))*\  +  DyJ(y*,u*)  =  0 


{  (Du£(y*,u*)(u- u*),\)x.xX  +  Du J(y*, u*)(u- u*)>0 

for  all  u  £  Uad,  assuming  Dy£m£(y* ,u*)  at  the  optimal  pair  ( y*,u *)  to  (4.1)  is  an 
isomorphism.  Suppose  um  and  u *  is  interior  points  of  Uad.  Then  we  can  apply 
Theorem  3.3  to  equation  for  (y,  A,  u)  £  X  x  X  x  U 

[  £(y,u)  =  o 

(4.6)  {  (Dy£(y,u))*X  + DyJ{y,u)  =  0 

[  (Du£(y,u))*X  + DuJ(y,u)  =  0 

In  general  we  have 

J(ym,um)  <  J(ym(u),u)  =  J(ym(u),u )  -  J{y(u),u)  +  J{y{u),u) 
for  u  £  Uad  and  thus  setting  u  =  u*,  we  obtain 

J{y(um),Um )  -  J*  <  j{y{um),um)  -  J(ym, um)  +  J{ym{u*),u*)  -  J(y(u*),u*). 
Hence,  if  Uad  is  compact  then 

J{y{um),um)  -J*<2M  max  \\ym{u)  -  y{u)\\x 

u*^uad 

for  some  constant  M. 

5.  The  Reduced  Basis  Method  for  Flow  Control 

In  this  section  we  discuss  vorticity  minimization  in  fluid  flows  using  boundary  tem¬ 
perature  control  and  electromagnetic  control.  We  first  present  the  weak  variational 
formulation  of  the  optimal  control  problems  and  then  discuss  their  approximations 
in  finite  element  and  reduced  basis  setting. 

Preliminaries.  We  denote  by  L2(0)  the  collection  of  Lebesgue  square-integrable 
functions  defined  on  f2.  Let  Lf1(f2)  =  £  L2(Q)  :  £  L2(£l)  for  i  =  1,2.}  and 

Hq(£1)  =  (n  £  :  w|r  =  0}.  Vector-valued  counterparts  of  these  spaces  are 

denoted  by  bold-face  symbols,  for  e.g.,  H(fl)l  =  [if1(fl)]c!  where  d  =  2.  We  denote 
the  norms  and  inner  products  for  HS(Q)  or  H(fl)s  by  ||  •  ||s  and  (•,  •).,,  respectively. 
The  L2(Q)  or  L2(fi)  inner  product  is  denoted  by  (-,  •). 
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5.1.  Electrically  Conducting  Flow  Equations  and  Variational  Formulation.  In  this 
section  we  describe  the  governing  equations  for  a  steady  electrically  conducting  flow 
and  their  variational  formulation.  Suppose  there  is  a  length  scale  £,  a  velocity  scale 
U  and  a  magnetic-field  scale  B0  in  the  flow,  then  one  can  define  nondimensional 
magnetic  Reynolds  number  R^  =  p0aU£,  where  p0  is  the  magnetic  permiability, 
Alfven  number  Al  =  Bl/p<ipU‘1  and  Reynolds  number  Re  =  p0U£/p.  Next,  if  we 
nondimensionalize  according  to  x  <—  x/f,  u  <—  u/U,  j  <—  Bq/£,  E  <—  UBq  and  p  <— 
(p  —  g  •  x)/(p0U2),  we  obtain  the  dimensionless  equations  for  electrically  conducting 
flow: 

u  •  Vu  =  -Vp  +  Al  (j  x  B)  +  A-  Au  and  V  •  u  =  0  in  Q , 

j  =  Rm  [- V(j)  +  (u  x  B)]  and  V  ■  j  =  0  in  fi, 


V  x  B  =  j  and  V  •  B  =  0  in  Q . 

Here,  u  denotes  the  velocity  field,  p  the  pressure  field,  j  the  electric  current  density, 
B  the  magnetic  field  and  (j)  the  electric  potential.  Moreover,  we  denote  by  Q  the  flow 
domain  which  is  bounded  in  1R2  with  boundary  T. 

To  simplify  the  exposition,  let  us  assume  that  we  are  dealing  with  a  special  case 
in  which  the  externally  applied  magnetic  field  is  undisturbed  by  the  flow.  That  is, 
we  assume  that  B  is  given.  Such  an  assumption  can  be  met  in  a  variety  of  physical 
applications,  for  example  in  the  modeling  of  electromagnetic  pumps  and  the  flow 
of  liquid  lithium  for  fusion  reactor  cooling  blankets,  see  [10]  and  [12].  Under  this 
assumption,  the  term  j  x  B  in  the  Navier-Stokes  equations  can  be  written  as 

Al( j  xB)  =  lV  (-V</>  +  u  x  B)  x  B 

where  N  =  Al  •  Rm  and  if  we  eliminate  j  by  applying  charge  conservation  condition 
V-j  =  0toj=Am  [-V0  +  (u  x  B)],  we  arrive  at  the  following  simplified  system 
modeling  the  flow: 

(  —  d-  Au  +  u  •  Vu  +  Vp  +  N  (V^>  —  u  x  B)  x  B  =  0  infi, 


(5.1) 


{  V  •  u  =  0  in  Q , 


[  —A(j)  +  V  •  (u  x  B)  =  0  in  0 . 

where  N  is  the  interaction  parameter.  The  system  (5.1)  is  supplemented  with  bound¬ 
ary  conditions 

(5.2)  u  =  u0  on  T,  (j)  =  g  on  T0  and  n  •  j  =  0  on  Ti 

where  T  is  the  disjoint  union  T  =  r0urj  and  g  denotes  the  control  function,  namely, 
electric  potential  on  T0.  Such  a  control  can  be  effected  by  attaching  electric  sources 
with  adjustable  resistors  to  the  electrode  along  the  boundary.  We  assume  that  the 
flow  is  two-dimensional  and  the  applied  magnetic  field  B  is  perpendicular  to  the 
flow  plane,  i.e.,  B  =  (0,0,  Bo)*,  and  that  the  cross  product  u  x  B  is  understood  as 
(ui,u2, 0)*  x  (0, 0,  B0)*-  Let  u  G  HJ(fi)  and  cj>  €  B1  (Q)  be  such  that 


u  =  u0  on  T  and  <p  =  g  on  To 
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and  Vi  =  {4>  G  H 1(0)  :  <f>  =  0  on  r0}.  Then,  variational  formulation  of  (5.1)— (5.2)  is 
given  as  follows:  seek  u  G  Hg(fi)  +  u,  p  €  L2(fl)  and  <f>  G  Hq(Q)  +  <f>  such  that 

i(Vu,  Vv)  +  b0(u,  u,  v)  +  N  (V(/>  -  u  x  B,  v  x  B) 

-(p,  V  •  v)  =  0  Vv  e  H l(Q), 

(5.3) 

(V  •  u,  g)  =  0  Vg  G  L2(Q), 

CVef)  —  u  x  B,  \7ip)  =0  Vip  £  V\. 

Here,  the  trilinear  form  60(-,  •,  •)  is  defined  by 

(5.4)  60(u,v,  w)  =  (u  •  Vv,  w) 
for  u,  v,  w  G  H1(fl). 

5.2.  Thermally  Convective  Flow  Equations  and  Variational  Formulation.  In  this 
section  we  describe  the  governing  equations  for  a  steady  thermally  conducting  flow 
and  their  variational  formulation. 

If  we  assume  there  is  a  length  scale  t,  a  velocity  scale  U  and  a  temperature 
scale  Ti  -  T0  in  the  flow,  then  one  can  define  nondimensional  Prandtl  number 
Pr  =  ftCp/K,  Grashof  number  Gr  =  f3£3pl\g\(Ti  -T0)/g?  and  Reynolds  number 
Re  =  poUt/ p.  Next,  if  we  nondimensionalize  according  to  x  <—  x/£,  u  4—  u/U, 
T  <—  (T  -  T0)/(Ti  -  T0),  and  p  <—  (p  -  g  •  x)/(p0U2),  we  obtain  the  following  nondi¬ 
mensional  form  of  Boussinesq  equations: 

-•TAu  +  (u  •  V)u  +  Vp  +  |prTg  =  0  in  Q , 

(5.5)  •  V  •  u  =  0  in  , 

.  ~~R«Pr  ^  u  ’  =  ®  infl, 

where  g  is  a  unit  vector  in  the  direction  of  gravitational  acceleration. 

We  consider  the  boundary  condition  as  follows. 


u  =  u0  on  T,  T  =  p  onFo  and 


0  on  Fi 


where  g  represents  the  boundary  temperature  control  function.  Let  T  G  H1^!)  be 
such  that  T  —  g  onR  and  u  be  as  defined  previously.  Then,  variational  formulation 
of  (5.5)-(5.6)  is  given  by 

'  T-(Vu,Vv)  +  60(u,u,v)  +  (/3Tg,v)-(p,V-v)  =0  Vv  G  Hj(ft), 

(5.7)  (V  •  u,  g)  =  0  Vg  G  L2(£l), 

(u,  T,  4>)  +  k  ( VT,  VV0  W>  £  V\, 

for  u  G  Hj(Q)  +u,pG  L2(Q)  and  T  £  H£(Q)  +  f,  where  /3  =  k  =  and  the 
trilinear  form  &i (-,  -  ,  •)  is  given  by 

6i(u,T,-0)  =  (u-VT,^). 

for  T,  if;  £  H\tt)  and  u  G  H^fi). 
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We  can  establish  the  existence  of  solutions  to  (5.3)  and  (5.7)  using  the  above 
properties  of  the  trilinear  forms  and  the  Hopf’s  lemma,  see  [5]. 

5.3.  Mixed  Finite  Element  Approximation.  In  order  to  construct  the  reduced  basis 
element  we  use  the  mixed  finite  element  method  [3]  to  approximate  solution  to  (5.1)- 
(5.2)  and  (5.5)-(5.6). 

Let  us  define,  using  standard  finite  element  notations, 

Xh  =  {v  G  C°(Q)  :  v\K  G  Pi(K),  on  each  element  K}  , 

Xh  =  {v  =  («i  ,v2)t  e  C0(H)  :  Vi  G  Xh,  i  =  1,2} 

and 

Sh  =  {q  €  C'°(0)  :  q\x  G  P\(K),  on  each  element  K}. 

Also  we  define  Xq  =  {v  G  Xh  :  v  =  0  on  T}  and  =  Xh  fl  V\.  That  is,  we  choose 
continuous  piecewise  quadratic  polynomials  for  both  components  of  the  velocity  uh 
and  electric  potential  <j>h  for  (5.1)  and  temperature  Th  for  (5.5),  continuous  piecewise 
linear  polynomials  for  the  pressure  ph .  On  each  triangle,  the  degrees  of  freedom 
for  quadratic  elements  are  the  function  values  at  the  vertices  and  midpoints  of  each 
edge;  the  degrees  of  freedom  for  linear  elements  are  the  function  values  at  the  vertices. 
Here,  the  spaces  are  defined  over  the  same  triangulation  of  the  domain  Q  =  (J  AT. 
This  selection  is  known  to  satisfy  the  inf-sup  condition,  see  [2]. 

The  finite  element  equation  of  (5.3)  for  uh  G  Xh,  ph  G  Sh  and  <f>h  G  Xh  is  given  by 

jk(Vu\Vv*)  + Vufc,uk,v*) 

+N  (y<f>h  -u'-xB/xB)- (/,  V  •  \h)  =  0  Vvh  G  Xg, 

(V  •  uft,  qh)  =0  \/qh  G  Sh , 

(V0ft  -u*xB,  Xiph)  =  0,  Viph  G  Xf. 

where  u^lr  =  uj-  and  <f>h |r0  =  gh  and  u§,  gh  are  the  projection  of  u0,  g  onto  the  finite 
element  spaces,  respectively.  Similarly,  for  the  Boussinesq  equation  (5.7)  we  have 

'  ^(VUh,  Vvh)+bo(uh,uh,vh)+(/3Th  g,vh)  —  (ph,  V  •  v'l)  =  0Vv/iGX(j-, 

{5.7)h  (V  •  uh,  qh)  =  0  \/qhGSh, 

.  h{ uh,  Th,  i>h)  +  k  ( VTh ,  Xiph)  Viph  G  Xf , 
where  G  Xh,  Th  G  Xh  satisfy  u'l|r  =  and  Th\Vo  =  gh,  respectively. 

5.4.  Boundary  Control  Problems  and  Reduced-Order  Control  Problems.  Let  U  be 

the  control  space  defined  by 

p 

U  =  {g  =  J2  9iXi,  Qi&R) 

i= 1 

where  Xi  is  the  i-th  basis  function  of  U  and  is  the  trace  of  a  H1(f2)  function  onto  r0. 
We  consider  the  minimization  of  the  form 

(5.8)  min  ||V  x  u||q  subject  to  (5.3)  or  (5.7) 
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where  the  vorticity  V  x  u  is  defined  by  V  x  u  =  and  the  cost  functional 

defines  the  total  friction  forces  in  fi. 

We  define  the  reduced  basis  element  by  the  finite  element  approximation  (5.3)'1 
and  (5.7)'1,  respectively  for  each  control  problem.  For  example,  the  Lagrange  re¬ 
duced  basis  element  (u ,p,T)  given  gj  £  U  for  problem  (5.8)  subject  to  (5.7)  can  be 
constructed  by  a  solution  ( uh,ph,Th )  to  (5.7)fe  given  gi  £  U.  For  the  case  of  the 
boundary  control  problem,  the  reduced  basis  space  Xr  C  Xh  x  Xh  should  consist 
of  the  basis  element  <Fg  that  corresponds  to  the  reference  control  g  £  U,  the  ele¬ 
ment  <Fg  that  corresponds  to  the  j-th  control  in  the  direction  of  Xj >  and  the  test 
functions  £  XRn  (Xj)  x  Xg).  Since  uh  satisfies  the  pseudo-divergence  condition 
(V  •  uft,  qh)  =  0  for  all  qh  £  Sh  we  have  the  reduced-order  control  problem; 


(5.9) 


min  [|  V  x  uft||^  subject  to 
i(Vu\  Vvh)  +  bo( uh,  uh,  vh)  +  ( (3Th  g,  vh ) 
+b1(uh,Th,'iph)  +  K(XTh,Xtph)  =  0 


for  all  'F'1  =  ( vh,iph )  £  XR  n  (Xg  x  Xg).  Here,  the  element  in  XR  is  represented  by 


(uh,Th) 


V 


3= 1 


(gj  ~  9j)  a ft 


where  <E>g  =  (uh,  Th) 0  is  a  solution  to  (5.7)fe  corresponding  to  the  reference  control, 
<E>)  =  (uh,  Th)j  -  (uh,Th) o  for  1  <  j  <  p,  with  ( uh,Th)j  being  a  solution  to  (5.7)h 
corresponding  to  g  +  6j  Xj  and  {^'g}  are  a  basis  of  the  test  function  space  Xr  Pi  (Xg  x 


problem; 


(5.10) 


min  ||  V  x  uh||n  subject  to 

•  d_(Vuft,  Vv'1)  +  b0(uh,  uh,  vh)  +  N  (V<f>h  -  uh  x  B,  x  B) 
+(V<ph  -uh  xB,  Xtph)  =  0, 


for  all  ’F'1  =  (vft, iph)  £XRD  (Xg  x  Xg). 


6.  Computational  Results 

In  this  section  we  will  give  computational  result  by  implementing  the  computa¬ 
tional  procedure  for  the  specific  control  problem  described  in  §1  using  the  two  pro¬ 
posed  control  mechanisms.  We  select  the  backward  facing  step  channel  for  our  study, 
a  schematic  of  this  geometry  and  the  finite  element  grid  are  given  in  Figure  6.1  and 
Figure  6.2,  respectively.  It  has  been  observed  in  a  number  of  computational  and 
experimental  study  on  this  specific  channel  geometry  that  a  recirculation  appears 
near  the  corner  region  for  large  Reynolds  number.  Our  aim  is  to  remove/suppress 
the  recirculation  by  means  of  boundary  control. 
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Figure  6.1.  Schematic  of  backward-facing  step  channel 


r,0p 


Figure  6.2.  Finite  element  grid  for  the  channel  geometry 


6.1.  Boundary  Temperature  Control.  The  aim  is  to  shape  the  flow  to  a  desired 
configuration  which  in  our  study  means  to  remove  the  recirculation  by  means  of 
controlling  the  temperature  along  the  bottom  part  of  the  boundary. 

We  assume  that  the  inflow  and  outflow  are  parabolic,  i.e.  we  take  the  inflow  to 
be  Uj  =  8 (y  —  0.5) (1  —  y)  and  the  outflow  to  be  u  =  u„  =  y(  1  —  y).  We  take 
the  Reynolds  number  to  be  200  and  to  be  1.  For  the  temperature  we  used  the 
following  boundary  conditions: 

TSl  and  rout  :  =  0 

r*2  and  •  T  —  g 

Fin  and  Ftop  :  T  —  l. 

Figure  6.3  qualitatively  demonstrate  the  flow  for  high  Reynolds  number.  Here  our 
objective  is  to  remove  the  recirculation  that  occurs  in  the  corner  region  Q*.  Therefore 
we  minimize  vorticity  in  the  corner  region  £2*  which  is  chosen  to  be  £2*  =  (1, 3)  x  (0,  .5). 
This  leads  us  to  a  constrained  minimization  problem  of  the  type  (5.8)  and  this  is 
solved  by  the  reduced  basis  computational  method  described  in  §5.4. 

Basis  elements  are  computed  with  g=l,  0.875,  0.75,  0.625,  0.5,  0.3775,  0.25  and 
denoted  by  (ui;  7j),  i  =  1, . . . ,  7.  The  test  functions  {Tj, . . . ,  T5}  are  chosen  so  that 
they  have  zero  boundary  conditions.  The  trial  function  4>i  corresponds  to  the  control 
force  such  that  dq  =  0  everywhere  on  the  boundary  except  on  the  bottom. 

Then  we  set 

(u,  T)  =  $0  + 

6  i=i 
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Figure  6.3.  The  velocity  field  for  the  uncontrolled  case  with  Re— 200 


Figure  6.4.  The  velocity  field  for  the  controlled  case  with  Re— 200: 
Temperature  control 


where  g  =  1,  <5  =  —0.75  and 

'  $o  =  (ui,Ti),  =  (u7,T7)  -  (ui,Ti), 

4>!  =  (u7,T7)  -  2(u6,T6)  +  (u5,T5),  4>2  =  (u6,T6)  -  2(u5,T5)  +  (u4,T4), 

=  (u5,T8)  -  2(u4,T4)  +  (u3,T3),  4»4  =  (u4,T4)  -  2(u3,T3)  +  (u2,T2), 

.  ^5  =  (u3,  T3)  -  2(u2,  T2)  +  (u4,  Ti). 

The  constrained  minimization  problem  is  solved  by  employing  Newtons  method 
to  the  necessary  optimality  condition  (4.4).  We  obtained  the  boundary  temperature 
control  T  =  0.516  in  7  Newton  iterations.  The  computed  control  was  then  used  in 
the  full  system  to  simulate  the  flow.  The  resulting  flow  shown  in  Figure  6.4  shows 
significant  reduction  in  the  size  of  the  recirculation  region. 

6.2.  Electromagnetic  Control.  In  this  problem,  control  is  effected  through  bound¬ 
ary  electric  potential  on  the  top  and  bottom  boundary  of  the  backward  facing  step 

channel.  A  magnetic  field  B  =  (0,0, 1)  is  applied  into  the  fluid.  The  boundary  con¬ 

ditions  for  the  velocity  are  the  same  as  in  the  preceding  control  problem  except  for 
the  electric  potential  whose  boundary  conditions  are  as  follows: 


Tjn  and  rou^  : 

^  —  0 
dn 

^top  " 

<P  =  9i 

^bottom  : 

<t>  =  92 

rS2: 

$= 1 • 

We  take  the  Reynolds  number  to  be  Re  =  200  and  the  interaction  parameter  to  be 
N  =  1.  Here  also  our  objective  is  to  suppress  the  recirculation  that  occurs  in  the  cor¬ 
ner  region  Cl*  and  thereby  obtain  a  relatively  smoother  flow.  Therefore  we  minimize 
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Figure  6.5.  The  velocity  field  for  the  controlled  case  with  Re=200: 
Electromagnetic  control 


vorticity  in  the  corner  region  Q*.  This  leads  us  to  a  constrained  minimization  prob¬ 
lem  of  the  type  (5.8)  and  we  use  the  reduced  basis  computational  method  described 
in  §5.4  to  solve  it. 

The  basis  elements  were  computed  with 


(9u  flaMl.l),  (1,0.5),  (1,0),  (0.5,1),  (0.5, 0.5),  (0.5,0),  (0,1),  (0,0.5) 


and  the  corresponding  elements  are  denoted  by  (u,,  </>»),  i  =  1, . . . ,  8.  The  test  func¬ 
tions  {’J'j, . . . ,  4/5}  are  chosen  so  that  they  have  zero  boundary  conditions.  The  trial 
functions  $1  and  4>2  corresponds  to  the  control  force  such  that  $1  =  0  everywhere 
on  the  boundary  except  on  the  top  and  4>2  =  0  everywhere  on  the  boundary  except 
on  the  bottom.  Then  we  set 


(u,  </>)  =  $0  +  -1  $1  + 

£>l 


(92  ~  92) 
62 


where  =  §2  =  1,  <$1  =  <52  =  —0.5  and 

<f>o  =  (ui,<?!>i),  4>i  =  (113,^3),  4>2  =  (u7)(/>7), 

^I  =  (ui,^l)-2(U2,^2)-2(U4,^4),  ^2  =  (Ui,<M-(u3,03)-(u7,07), 

^3  =  (ui,0i)  -  (u5>5)  -  (u2,02)  -  (U4,04),  ^4  =  (ui,0i)  -  (u 8,0s)  -  (u 2,  fa), 
.  W5  =  (ui,^1)-(u6,^6)-(u4,04). 


We  employed  the  Newtons  method  to  the  necessary  optimality  condition  (4.4)  and 
obtained  the  boundary  controls  0top  =  1-0423  and  </>top  =  1-7735  respectively,  in  5 
Newton  iterations.  The  computed  control  was  then  used  in  the  full  system  to  simulate 
the  flow.  The  resulting  flow  shown  in  Figure  6.5  shows  significant  reduction  in  the 
size  of  the  recirculation  region. 

In  conclusion,  we  have  demonstrated  the  feasibility  of  using  reduced  basis  method 
in  both  one  parameter  and  two  parameter  control  setting.  Two  different  control 
mechanisms  have  been  used  in  two  different  fluid  flow  setting.  Our  numerical  results 
seem  to  indicate  that  the  reduced  basis  method  can  be  successfully  used  in  flow  control 
problems  with  significant  reduction  in  computational  cost  compare  to  the  results 
presented  in  [3],  [5]  and  [9]  for  the  same  problems  where  computations  were  performed 
by  directly  applying  finite  element  methods  to  the  optimal  control  problems. 
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Abstract.  The  application  of  a  proximal  point  approach  to  ill-posed  convex  control  prob¬ 
lems  governed  by  linear  parabolic  equations  is  studied.  A  stable  penalty  method  is  con¬ 
structed  by  means  of  multi-step  proximal  regularization  (only  w.r.t.  the  control  functions) 
in  the  penalized  problems.  For  distributed  control  problems  with  state  constraints  conver¬ 
gence  of  the  approximately  determined  solutions  of  the  regularized  problems  to  an  optimal 
process  is  proved. 
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uted  control,  penalty  methods. 


1.  Introduction 

In  this  paper  the  proximal  point  approach  coupled  with  the  penalty  technique 
is  developed  for  solving  ill-posed  convex  parabolic  control  problems  with  state  con¬ 
straints.  The  investigation  is  concentrated  on  problems  governed  by  linear  parabolic 
equations,  the  objective  functional  and  the  sets  of  admissible  controls  and  states  are 
assumed  to  be  convex. 

Usually,  convergence  of  numerical  methods  for  such  problems  is  studied  under  the 
additional  assumption  that  the  objective  functional  is  strictly  (or  strongly)  convex 
w.r.t.  the  control,  or  that  the  optimal  control  possesses  the  bang-bang  property. 
We  refer  here  to  Alt  and  Mackenroth  (1989),  Glashoff  and  Sachs  (1977), 
Hackbusch  and  Will  (1984),  Knowles  (1982),  Lasiecka  (1980,  1984),  Mack¬ 
enroth  (1982-83,  1987),  Malanowski  (1981),  Troeltzsch  (1987). 

The  first  results,  connected  with  the  use  of  the  penalty  technique  for  control  problems 
are  obtained  by  Lions  (1968)  and  Balakrishnan  (1968  A,B),  for  further  applica¬ 
tions  see  Lions  (1985).  Penalization  of  the  state  equation,  permits  to  handle  with 
control  and  state  variables  as  independent  ones.  In  Bergounioux  (1992,  1994),  for 
convex  elliptic  and  parabolic  control  problems  with  state  constraints,  penalty  meth¬ 
ods  have  been  used  in  order  to  prove  the  existence  of  Lagrange  multipliers  under 
weak  qualification  hypothesises. 
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In  all  these  investigations  strong  convexity  of  the  objective  functional  was  one  of  the 
essential  conditions. 

The  paper  presented  here  deals  with  convex  parabolic  control  problems  without  addi¬ 
tional  assumptions  mentioned  above.  So,  the  problem  may  be  non-uniquely  solvable, 
moreover,  we  don’t  exclude  that  the  set  of  optimal  controls  can  be  unbounded.  Using 
the  scheme  of  multi-step  regularization  developed  in  Kaplan  and  Tichatschke 
(1994)  for  abstract  convex  variational  problems,  a  partial  proximal  regularization 
(w.r.t.  the  control  only)  of  the  family  of  penalized  problems  is  performed.  This  per¬ 
mits  to  handle  with  well-posed  auxiliary  problems  and  to  ensure  weak  convergence 
of  their  approximately  determined  solutions  to  an  optimal  process,  as  well  as  conver¬ 
gence  of  the  corresponding  values  of  the  objective  functional  to  the  optimal  value  of 
the  original  problem. 

For  convex  elliptic  control  problems  an  analogous  approach  has  been  realized  in 
Hettich,  Kaplan  and  Tichatschke  (1994,  1996).  In  the  last  two  decades  prox¬ 
imal  point  technique  is  sucessfully  developed  for  solving  variational  inequalities  with 
monotonous  operators,  including  convex  optimization  problems  and  saddle-point 
problems.  Eckstein  and  Bertsekas  (1992)  have  shown  a  relationship  between 
the  proximal  point  method  and  the  Douglas-Rachford  splitting  method,  pointing  out 
new  application  fields,  especially  in  mathematical  physics.  Nevertheless,  besides  the 
papers  mentioned  here,  we  don’t  know  publications,  where  proximal  point  technique 
was  applied  to  control  problems. 

2.  Formulation  of  the  control  problem 

Let  Q  C  Mn  be  a  bounded  domain  with  a  boundary  dQ,  of  the  class  C2,  be  locally 
situated  on  one  side  of  dQ,  and 

Q  =  Q  x  ]0,T[,  £  =  x  ]0,T[. 

In  the  sequel  we  use  the  following  notation  for  functional  spaces: 

L2(0,T',Z)  -  space  of  functions  with  range  in  a  Hilbert  space  Z ,  square  integrable 
on  (0,T), 

|M|l2(0,T;Z)  =  (^Jq  llw(f)lll^  i 

||  •  ||o,q  -  norm  of  an  element  in  L2(0,T;  L2{p))\ 

C([0,T];  Z)  -  space  of  continuous  functions  on  [0,T]  with  range  in  Z , 

IMIc,([o,7’];z)  =  'maxo<t<T\[v(t)\\z\ 

Hs{ fi),  Hs0{Vt)  -  standard  Sobolev  spaces,  L2(Q)  =  H°{ ft),  ||  •  ||s,n  -  norm  in  tfs(ft); 
II  •  ||o,s,n  ~~  norm  in  L2(0,T]  HS(Q))  for  s  >  1; 

(•,  -)o  -  inner  product  in  L2(Cl)\ 

X  •—*  H  -  continuous  embedding  of  the  space  X  into  H. 

We  consider  the  parabolic  equation 

dv 

~(x,t)  +  Ay(x,t)  =u(x,t)  a.  e.  in  Q, 


(2.1) 
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y(x,  0)  =  0  in  fl, 


(2.3)  y(x,  t)  =  0  on  E, 
where  the  elliptic  operator  A  is  given  by 

n  Q  (  Qy  \ 

(2.4)  Ay  =  -  —  \  aij(x,t)—J  +  a0(x,t)y, 

with  a,ij  £  C2(Q),  a0  G  C2(Q )  such  that  for  all  ( x ,  t)  G  Q,  £  G  M"  and  some  d0  >  0 

n  n 

(2.5)  a0(x,t)  >0  and  aij(x,t)&j  >  doYl  &■ 

i,j= 1  i= 1 

For  each  u  £  U  =  1,2(0, T;  L2(Q))  Problem  (2.1)-(2.3)  is  uniquely  solvable,  and  its 
solution  yu  belongs  to 

(2.6)  fF  =  {yGJL2(0,T;i/2(n)n//01(fi))  :  ^  £L2(0,T;  L2(Cl)),y(x,0)  =  0  in  Q} 

(see,  for  instance  Lions  and  Magenes  (1968),  vol.  1.).  The  space  W  endowed  by 
the  norm 

(2-7)  IMk  =  (lMlo,2,n+  ) 


is  a  Hilbert  space.  Moreover,  (ibid.,  Theorem  1.3.1) 

W  c-*  C([0,T];  Hq(Q)), 

and  the  operator  T  :  Tu  =  yu  is  continuous  as  a  mapping  from  £2(0,  T ;  L2(Q))  into 
C'([0,T];Hr01(Q))  (see  Lions  and  Magenes  (1968),  vol.  2.). 

In  order  to  formulate  the  control  problem  we  introduce  the  space 

(2.8)  Y  =  {y  G  L2( 0,  T;  H^Q))  :  ~  +  Ay  £  L2( 0,  T;  L2(ft)),  y(x,  0)  =  0  in  Q} 

which  coincides  algebraically  with  W:  Indeed,  regarding  the  smoothness  of  Oy,Oo, 
the  inclusion  IF  C  F  is  obvious,  and  the  inclusion  Y  C  W  is  a  consequence  of  the 
fact  that  yu  G  W  for  each  u  £  U. 

Using  the  inequality 

\\v\\w  <  c\\y\\Y  Yy  G  Y, 

which  follows  from  the  Lp-estimates  for  the  solutions  of  parabolic  equations  (see 
Ladyshenskaja,  Solonnikov  and  Ural’zewa  (1968),  Theorem  4.9.1),  one  can 
easily  show  that  the  space  Y  with  the  norm 

(2-9)  \\y\\Y  =  dJ  +  Ay 

ai  0  ,Q 

is  a  Hilbert  space,  too;  moreover, 

Y^C^Tj-H^Q)). 

The  approach  suggested  will  be  presented  with  the  following  model  problem 
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Problem  ( P )  : 

minimize  J{u)  =  jlr/^T)  —  2/rf|lo,n  subject  to  u  E  Uad,  yu  E  G, 

where  Uad  and  G  are  convex  and  closed  sets  in  the  spaces  L2(0,T]  L2(£l))  and  Y, 
respectively;  yd  E  L2(£l)  is  a  given  function  and  it  is  supposed  that  {u  E  Uad  :  yu  G 
6’}  =/  0- 

Due  to  the  continuity  of  the  mapping 

T:  L2(0,T;L2(n))  —  C(\Q,T]\ Hq(Q.)) 

the  functional  J  is  continuous  -on  L2(0,  T-,L2(£l)).  Therefore,  if  Uad  is  bounded, 
Problem  (. P )  is  solvable,  but  in  general,  non-uniquely  solvable.  If  Uad  is  unbounded, 
it  may  happen  that  the  set  of  optimal  controls  is  empty  or  unbounded. 

We  introduce  the  space 

H  =  Pxi2(o,r;i2(fi)), 

endowed  by  the  natural  norm:  Tor  z  —  ( y ,  u )  with  y  eY,  u  E  L2{0,T-  L2{Q)), 

(2.10)  Mh  =  (tb'lly  +  IMIo,<?)1/2  • 

3.  Regularized  penalty  method  (RP-method) 

Method  (Multi-step  regularization) 

Let  {r*},  {e2},  {xi}-,  and  {Si}  be  positive  sequences  with 

linii^ri  =  0,  supi  r*  <  1,  lim^o o—  =  0,  supt  Xi  <  2, 

Xi 

and  u°  G  Uad- 

Step  i:  Given  u1"1  E  Uad- 

a)  Set  u 1,0  :=  w“_1,  s  :=  1. 

b)  Given  u M"1,  let 

(3.1)  (yi’s,ut’s)  =  argmin{^iiS(r/,n)  :  (y,u)EG  x  Uad } 
with 

(3.2)  Vi'siy,  u)  =  \\y(T)  -  j/d||o,fi  +  —  ~^  +  Ay~ 

Compute  an  approximation  ( yl’s,u1,s )  G  G  x  Uad  of  (yl’s,ul,s)  such  that 

(3.3)  |(j/i's,wi’s)-(yi's,^s)||=<  X 

w  Xi 

c)  If  ||u‘>s  -  «i,s_1||0,Q  >  Si,  set  s  :=  s  +  1  and  repeat  b). 

Otherwise,  set  id  :=  s(i)  :=s,i:=i  +  1,  and  continue  with  Step  ( i  +  1). 

Of  course,  the  stopping  rule  (3.3)  is  not  yet  practicable.  But,  as  it  will  be  shown 
below,  the  functional  is  strongly  convex  on  5.  This  usually  permits  to  satisfy 
(3.3)  by  means  of  a  stopping  criterion  of  an  algorithm,  minimizing  on  G  x  Uad- 


+  §  u  -  u1’*-1  2  . 
2  0  ,Q 
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4.  Convergence  of  the  RP-method 

For  shortness,  in  the  sequel  we  will  use  the  following  abbreviations: 

(4.1)  z  =  (y, U),  z*  =  (y*,u*),  ?’s  =  #s)  etc. 

Let 

(4.2)  \z\=^^  +  Ay-u  +Wo 


Lemma  4.2.  The  functional  4qiS  is  continuous  and  strongly  convex  on  E. 

Proof.  Due  to  Lemma  4.1,  continuity  of  || y(T)  —  yd\\o,n  on  E  ensures  continuity  of 
s .  Now,  let  us  prove  strong  convexity.  To  this  end,  we  rewrite  the  functional  as 
follows: 


^i,s(y’u) 


(4.4) 


IWr)-rf«+(i-|)  t  +  Av~u  w 

+  Xl  dJt+  Ay -A  +^|L_„..-f 

2  dt  y  QQ  2  II  Ho, <3 

HT)-yd\\l n+(^-f)  t  +  Ay~u  qq 

+§  ii^iio, o  s:  («w,  +  §  \m\2- 


Because  of  r»  <  1,  x»  <  2  the  term  in  the  square  brackets  is  a  quadratic  functional  with 
a  non-negative  quadratic  term  in  (y,u),  hence,  it  is  a  convex  functional.  Therefore, 
taking  into  account  Lemma  4.1,  4qiS  is  strongly  convex  on  the  space  E  with  the  norm 
||  •  ||=  or  |  •  j.  □ 
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The  following  result  is  an  analogon  of  Lemma  22.3  in  Kaplan  and  Tichatschke 
(1994).  Let  Zbea  Hilbert  space  with  an  inner  product  ((•,  -))z  and  a  norm  ||  •  \\z ;  Z\ 
be  a  closed  subspace  of  Z  and  V  :  Z  ->  Zx  be  the  orthogonal  projection  operator. 
We  consider  the  problem 

(4.5)  minimize  4>(,z)  =  a(z,  z)  —  i(z)  subject  to  z  £  K, 

where  a(-,  •)  is  a  continuous,  symmetric  and  positive  semi-definite  bilinear  form  on 
Z  x  Z,  i  is  a  linear,  continuous  functional  on  Z  and  K  C  Z  is  a  convex,  closed  set. 
Further,  suppose  that  b(-,  •)  is  a  second  symmetric  bilinear  form  on  Z  x  Z  such  that 

(4.6)  0  <  b(z,  z)  <  a(z ,  z)  for  z  &  Z, 
and,  with  some  (3  >  0, 

(4.7)  b(z,  z)  +  \\Vz\W  >  p\\z\\%  for  all  z  £  Z. 

By 

(4.8)  \z\l  =  b{z,z)  +  \\Vz\\2z 

another  norm  is  defined  on  Z,  which  is  equivalent  to  ||  ■  \\z  according  to  the  obvious 
relation 

{M+l)\\z\\%>\z\l>(3\\z\?z 

with  M  >  sup^o  Tm#- 


Lemma  4.3.  For  each  a0  £  Z  and 


(4.9) 

a1  =  arg  min  {4>(^)  +  p.\\Vz 
z 

-VaXz 

(x  e  (0, 2]  is 

kept  fixed)  the  following  inequalities  are  true  for  all  z  £  K: 

(4.10)  j 

la1  -  z\%  -  |a°  -  z\%  <  -HPo1  -  Va° |||  +  h*{z)  -  ^(a1)] 

and 

(4.11) 

la1  —  z\z  <  |a°  —  z\ 

z  +  r)(z), 

with 

v{z)  _  {  [!(*(*) -^(a1))] 1/2 

if$(z)  >  $(ax) 

1  0 

otherwise 

If,  moreover, 

|| Vax  —  Va°\\z  >  6  >  T}(z),  then 

(4.12) 

| a1  -  z\z  <  | a0  -  z\z  + 

rf(z)  —  <52 

2|a°  -  z\z 

Now  we  come  back  to  the  control  problem. 

•  The  case  of  a  bounded  set  Uad 

Assume  there  exists  a  point  u  £  Uad  such  that 


ya  =  Tu  G  int  G  (in  Y). 


A.  Kaplan  and  R.  Tichatschke 


175 


Lemma  4.4.  Let  ( y*,u *)  be  an  optimal  process  of  Problem  (P),  v  G  (0,  |)  be  an  ar¬ 
bitrary  number.  Suppose  that  u*’5-1  G  Uaci  is  arbitrarily  chosen  and  (yhs,ul's)  and 
(yi’s,ui’s)  are  defined  by  (3.1)  and  (3.3)  with  this  u Then  there  exist  two  con¬ 
stants  d(v)  and  di,  independent  of  i,  s  >  1,  {e^},  {r,},  and  {xi},  such  that 

(4.13)  Ji(y*,u*)  -  Mtf’f  vhs)  <  d(v)r\-2'' 
and 

(4.14) 
with 

(4.15) 


Ji{y,u)  =  ||  y(T) 


\(yi’°,ui’s)-(y*,u*)\<d1, 
dy 


1 1 0,0 


+ 


1 


dt 


+  Ay  —  u 


o  ,Q 


Underline  that  |  •  |  is  defined  by  (4.2),  and  the  controlling  sequences  {r*},  {e,},  and 
{Xi}  are  chosen  according  to  the  RP-method. 

Proof.  The  existence  of  the  points  (y*’s,u*’s)  and  (yl's,ul's)  is  guaranteed  by  Lemma 
4.2.  Now,  we  introduce  the  following  notation: 

yi,s  =  Tv)'3,  z's  =  (yi’a,&’8)  ,  z(u)  =  (Tu, u), 

Tmin  =  min  ||Tu  -  w||y,  Tmax  =  max  II z(u)  -  z(u)||H, 

wedG  'u€.Uad 

■wi,s  =  arg  mmveGxUad  \\z's  -  n||_  . 

Note  that  rmax  >  Tmin  if  {Tu  :  u  G  Uad}  H  dG  ^  0.  In  case  of  yi>s  £  G  we  define  the 
points 

hl,s  G  {z(u)  +  A  (z%'s  -  z(u ))  :  A  >  o}  n  {dG  x  Uad}  , 
and  (if  hh,s  ^  wt,s) 

k*'3  G  {z(u)  +  A  (z’a  -  w i>s)  :  A  >  0}  fl  {h*'s  +  y  (/iM  -  u/’s)  y  >  o}  . 
Obviously,  the  points  hz’s  and  kl's  are  uniquely  determined,  and 


\\z  —  w 


Li 

\\z  —  h l' 


||fc*>s  —  z(«)||H  ||/ii>s  —  z(u)\\s 

Due  to  the  trivial  implication  —  1  ^  |  => 

\z’s  -  z{u)\\e 


we  obtain 


1  —t i  i  < 

\\z  -  h v 


\\kbs  —  z(u)  ||E+  z's  —  wL- 


I  .  S  7  c 

\\z  -W’ 


„  'max 

<  -  \\Z  —  W 


(4.16) 

In  the  standard  manner  the  Gateaux-differentiability  of  the  functional 

l|2 


dy_ 

dt 


+  Ay-  u 


o  ,<? 


T  „  11“  u  llo.Q 
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in  the  space  S  with  the  norm  |  ■  |  can  be  established.  Regarding  the  definition 
of  {yl's,ul's),  we  obtain  by  means  of  Proposition  II. 2. 2  in  Ekeland  and  Temam 
(1976)  that,  for  all  (y,  u)  G  G  x  Uad, 


(4.17) 


MT)-yd\\ln-\\ts{T)-ydfQ 
+  2  fT  ( dyi,s(t) 

n  Jo 

dy(t) 


dt 


+  V(t)-«M(t), 


+  Ay(t)  -  u(t)  -  -  At\t)  +  vT(t) )  dt 


dt  '  "*v''  “v”'  dt 

rT 


+Xi  f0  -  rf's  l{t)  ,  u(t)  -  wi,s(i))n  dt  >  0. 


Setting  y  =  y*,  u  =  u*  in  (4.17),  in  view  of  ^  +  Ay*  -  u*  =  0  and  the  obvious 


inequality 


(4.18)  u-tf'S-1  2  u*1-1  * 

V  ’  0,Q  0,Q 


>2  J  (ui,s(t)  -  u’,s  *(f)  ,  u(t)  -  uM(t))Qdt, 


one  can  conclude  that 


(4.19) 


dy1 


dt 


+  Afs  -  vJ 


0,Q 


Thus, 


(4.20)  ||r*t < (D^^b-ui-wii^+l |«>  - »‘-ta)'/!+|[«‘1M ■ 


Now,  regarding  the  boundedness  of  Uad,  iq  <  1,  x*  <  2,  lim 


Li— foo  ^  0)  'LLLlI 


condition  (3.3),  inequality  (4.20)  yields 


(4.21) 


W’sL<ci,  bl’sL<ci 


(all  the  constants  c*,  in  this  proof  don’t  depend  on  ( i,s )).  Estimate  (4.14)  follows 
immediately  from  (4.21)  and  the  equivalence  of  the  norms  ||  ■  ||s  and  |  •  |. 

Due  to  (4.21)  and 


b\\w  ^  c\\v\\y  for  all  yeY, 

one  gets 
(4.22) 

Inequality  (4.19)  ensures  also  that 


\f’s  w  <  c2>  yl's  w  <  c2  with  c2  =  cci. 


(4.23) 


dyl 


dt 


■  +  Ayi,s  -  # 


.  1/2 
<  c3ry  . 


lo,<3 


A.  Kaplan  and  R.  Tichatschke 


177 


Taking  into  account  that  +  AyhS  -  ul's  =  0,  this  leads  to 

(4.24)  \ts-Vi's\Y<czr\12 

and,  due  to  \\yl's  -  yhS\\Y  —  ||^’s  -  >  we  obtain 

(4.25)  |zi,s-li’s|=  <c3r1i/2. 

Because  zl,s  G  G  x  Uact,  the  estimate 

(4.26)  ||wJ,s  —  t',4|_  <  c3r^2 

follows  from  (4.25)  and  from  the  definition  of  wt,s. 

Denote  by 


Zf  =  {z  =  (Tu,u)  :  zeGxUad} 

the  set  of  feasible  processes  and 

F’s  =  (f's,S<lS)  =  arg  min  {||F'S  -  z\\s  :  z  6  Zfj . 

If  yl,s  G  and  wl's  ^  hl,s,  then  on  account  of  hl's  G  Zf,  we  obtain  from  (4.16),  (4.25) 
and  (4.26)  that 

||F’s-im||=  <  ||F’s  - 

(4.27)  <  ||F’S  -  fM||=  +  1 1*’4  -  h%  <  +  l)  czry\ 

If  yi,s  G ,  but  wi>s  =  hi,s ,  estimate  (4.27)  follows  from  (4.25),  (4.26)  and  Tmax  >  rTOjn. 
In  case  yt,s  G  G,  the  inequality 


(4.28) 


\\zhS  -  zl,s  I  <  c3r. 


1/2 


is  an  immediate  consequence  of  (4.25). 

Inserting  into  relation  (4.17)  y  =  yhS ,  u  =  Ul,s ,  one  gets 

l|2 


\\yi’s(T)-yd\\ln  +  ~ 


dy *’ 


dt 


+  Af's  -  u1’ 


0  ,<? 


<  ^(T)-yd\\on  +  Xi  u*’s-u  ]iQQ 


*•*11  llyM_uM-l| 


llo, Q  • 


and  hence, 


dy1' 


2  II  WiTS.S 

n 


dt 


■  Afs  -  u1' 


< 


0  ,Q 


|r>(T)  _  ^(r)|on  ||rs(r)  +  f°{T) 


(4.29) 


+  Xi  fj'S  -  u^-u*’*-1 

o  ,Q 


o  ,Q 


Because  of  Y  c~*  <17( [0,  T1] ;  i?o (^) ) 5  (4-22),  (4.27),  <  1,  Xi  <  2  and  the  boundedness 

of  Uad,  inequality  (4.29)  leads  to 


(4.30) 


dy1' 


dt 


+  Af's  -  u1' 


<  Wi 


3/4 


0  ,Q 
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Using  (4.30)  instead  of  (4.23),  the  estimates  (4.24)-(4.26)  can  be  improved  (w.r.t.  the 
order)  and  we  obtain 


s  a.i,s  ^  3/4 

z  '  —  z  „  <  c4r/  , 


-  f  ’  L  < 


Thus,  similar  to  (4.27),  (4.28),  the  inequality 


•  F’i  <  (—  +  l) 

“  \7~7nin  / 


can  be  established. 

A  multiple  repetition  of  this  operation  (using  in  each  step  the  current  estimates) 
leads  to  the  conclusion  that,  with  arbitrarily  fixed  v  e  (0,  |)  and  some  constant  c(v), 
the  estimates 

(4.33)  +  Ats  -  <  c{v)r\~2\ 

0  ,Q 


(4.34) 


_  |v||  <  f7™?.  +  i')  c(v)rj~ 

a  \  T-min  ' 


are  valid  uniformly  w.r.t.  ( i ,  s).  Now,  from  the  obvious  equality 

Ji(z*)  -  Ji  (^)  =  II y*(T)  -  ytWlfi  -  | f’s(T)  -  yd(on  +  ||r  (T)  -  yd ||’n 

ii  1 1 2  1  dv^,s 

-  pr+Air"-a'\,a’ 

due  to  ||j/*(T)  -yj0|n  <  \tf' S{T)  -  yd\\Q (4.21),  (4.34),  and  the  embedding  Y 
C([01T];F01(^)))weget 

Ji(z*)  -  Ji  (V's)  <  d{y)r\~2\ 

with  d(v)  independent  from  (i,s),  i.e.  estimate  (4.13)  is  true.  □ 

Theorem  4.1.  Assume  that  Uad  is  a  bounded  set  and  Tu  €  intG  for  some  u  G  Uad; 
that  v  G  (0,  |)  is  a  fixed  number  and  that  constants  d{y),  d\  are  defined  according 
to  Lemma  4.4.  Let  the  positive  sequences  {n},  {e,},  {Xi}>  and  {^i}  in  the  RP-method 
satisfy  the  conditions 

oo  r}l2~v  oo  £ 

(4.35)  suprj  <  1,  supxi  <  2,  £-  -*  x,2  <  oo,  —  <  oo 

i  i  i=l  X.i  ®=1  ™ 


2 ihVtT  +^<0' 
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Then,  for  any  starting  point  u°  6  Uad,  the  RP-method  is  well-defined,  i.e.  s(i)  <  oo 
for  each  i,  and  {til,s},  {yl,s}  converge  weakly  in  L2(Q),Y  to  u,  y  respectively,  where 
( y,u )  is  an  optimal  process  for  Problem  ( P ). 

Proof.  Let  us  assume  that  s(i)  <  oo  for  i  =  1, . . . ,  k  —  1.  Then,  starting  in  step  i  =  k 
with  uk~l  —  due  to  the  definition  of  s(i),  (3.3)  and  (4.36),  we  conclude 

|UM_uM-i||  >  |Lm_um-i||  -  \\uk’s  -  uk’s\\ 

II  llo.Q  ~  II  llo.Q  II  IIO.Q 

>  6k  —  —  >  0  for  1  <  s  <  s(k). 

Xk 

Together  with  inequality  (4.13)  and 

2d{v)r^-  -  Uk  -  ^ )  <0 

Xk  V  Xk) 

(cf.  (4.36)),  this  implies 

(4.37)  \\uk'a  -  >  —  \jk{z*)  -  Jk(zk,s)  1  for  1  <  s  <  s(k). 

II  llo.Q  Xk  L  J 

Let  z1'0  =  (T u°,u°).  Applying  (4.13)  and  Lemma  4.3  with 

Z  =  E,  Zi  =  {z  =  (y,u)  e  E  :  y  =  0},  $  =  <4, 


«(z,  z)  =  (y(T),y(T))a+f  +  Ay(t)  -  «((),  ^  +  Ay(t)  ~ 

b(z.  z)  =  ff  +  -49(f)  -  «(()  ,  'ff  +  Ay(t )  -  i(t)j  it , 

e(z)  =  2(y(T),yd)n,  K  =  G  x  Uad,  of)  =  zk’s~1,  x  =  Xk,  and  6  =  6k-^-, 

Xk 

we  obtain  from  (4.12)  and  (4.14) 

(4.38)  \zk’s-z*\ 

<  -  z*\  +  X-  \2d(u)  — - (sk  -  — )  ]  for  1  <  s  <  s(k). 

1  >2 di  y  Xk  V  Xk)  ) 

Using  (3.3),  (4.3)  and  (4.36),  inequality  (4.38)  yields 

(4.39)  \zk,s  —  z*\  —  1^,5— i  —  z* 


„l-2i/ 


<  2*  Vd[vy 


Xk 


t>k  ~ 


+  ^  <  0. 

Xk 
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Inequality  (4.39)  proves  that  s(k)  <  oo,  because  the  middle  term  in  (4.39)  is  inde- 
pendend  from  s. 

Now,  for  s  =  s(k),  the  use  of  Lemma  4.3  with  the  same  data  as  above,  leads  to 


I  — fc,s(fc) 


< 


Lfc,s(fc)-i  _ 


„l-2i/ 


2  d(v)- 


Xk 


hence, 


(4.40) 


fc,s(fc) 


< 


\zkAk)-i  _  z* 


+ 


„l-2i/ 


2 d{v)- 


Xk 


V3— . 

Xk 


Taking  into  account  that  the  finiteness  of  s(l)  can  be  proved  quite  analogously,  we 
infer  that 


s(i)  <  oo  for  each  i, 

and  the  inequalities  (4.39)  and  (4.40)  are  valid  for  each  k. 

In  view  of  (4.39),  (4.40)  and  (4.35),  Lemma  2.2.2  from  Polyak  (1987)  ensures  the 
convergence  of  the  sequence  {| zhS  —  z* |},  and  with  regard  to  (3.3),  (4.3)  and  the  last 
inequality  in  (4.35),  the  sequence  {| zl’s  -  z* |}  converges  to  the  same  limit. 

Suppose  that  {zik’Sk},  with  s*.  >  0  for  each  k,  converges  weakly  to  z  —  ( y,u )  G  S. 
Due  to  (4.34),  (4.35),  converges  weakly  to  z,  too.  Observing  the  convexity 

and  the  closedness  of  Zf  and  that  C  Zf,  we  conclude  that  z  G  Zf. 

But  Lemma  4.3  yields  also 

-z* f  -  l^-1  -  z*f  <  —  \ji(z*)  -  Ji  , 

I  II  1  Xi  L  v  n 

and  by  definition  of  ./,  (cf  (4.15)) 

Ji(z*)  =  \\y*(T)  -  UdWln  =  Jitf'3)  >  1  j/M(T)  -  ydfoa, 


hence, 


\Z"°  -z  -  z1 


i.S—  1 


,2  2 
<  — 
1  Xi 


J(0-  ts(T)-yd 


o,n 


Due  to  the  convexity  and  the  continuity  (in  Y),  the  functional  || y(T)  —  2/dilo,n 
weakly  lower  semi-continuous.  Taking  limit  in  the  last  inequality  w.r.t.  the  subse¬ 
quence  {zlk’Sk},  we  obtain 

J(u*)  >  ||y(T)  - 


II2 

Ho, 12  > 


hence,  z  is  an  optimal  process.  Finally,  Lemma  4.1  in  Opial  (1967)  ensures  weak 
convergence  of  both  {zhS}  and  {zl’s}  to  z  G  S.  □ 

•  The  case  of  unbounded  set  Uad 

Now,  we  formulate  convergence  results  for  the  case  of  an  unbounded  set  Uad  and 
G  =  Y.  Hereby  solvability  of  Problem  ( P )  is  assumed.  Let  us  choose 

CO  || V  CO  —  2/d|lo,S2  >  Pi,s- 1  ^  0- 
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Let 


(4.41)  Ul'S  1  G  {ueUad  :  ||u  -  U*||o,g  <  Pi,s- 1} 

be  arbitrarily  fixed,  where  2:*  =  ( y*,u *)  is  an  optimal  process.  Suppose  that  the 
points  (yhS,uhS)  and  ( yl’s,ul’s )  are  defined  by  (3.1)  and  (3.3)  with  this 

Lemma  4.5.  Let  the  sequences  {r;},  {<:,},  {y,}  be  chosen  according  to  the  RP-method 
and  v  G  (0,  be  an  arbitrary  number.  Then  there  exists  a  constant  d( v),  independent 
from  i,  s  >  1,  {?y},  {e,},  {xi}  and  ut,s-1  in  (4.41 ),  such  that 

(4.42)  Ji(y*,u*)  -  Ji  ( yhS ,  uhS^j  <  d(y )  (co  +  pi,s-\f  r]~2v. 

Theorem  4.2.  Let  u°  G  Uad,  .z1,0  =  ( Tu°,u° ).  With  pi  >  Hz1,0  —  z*||  the  sequence 
{pi}  let  be  defined  recursively  by 


(4.43) 


Pi+l  —  Pi  + 


N 


Xi  Xi 


Moreover,  assume  that  the  controlling  parameters  in  the  RP-method  satisfy  the  con¬ 
ditions  (4.35)  and,  for  each  i,  let 


(4.44)  -L  ( 2d»  (c0  +  Pi)2  ^  -  (St  -  ^ ))  +  <  0,  6t  >  X 

2pi  ^  Xi  V  Xi)  )  Xi  Xi 

Then,  applying  the  RP-method  to  Problem  (P)  with  G  =  Y,  one  gets  s(i)  <  00  for 
each  i,  and  {uv},  {yM}  converge  weakly  in  L2(Q),  Y  tou,y  respectively,  where  (y,  u) 
is  an  optimal  process  for  Problem  (P). 


Note  that,  due  to  (4.35)  and  Lemma  2.2.2  in  Polyak  (1987),  the  sequence  {pi}  is 
convergent. 

The  conditions  of  the  Theorems  4.1  and  4.2  permit  a  slow  change  of  the  controlling 
parameters  rj,  and  Xi  :  For  instance,  it  is  possible  to  take 

0  <  X  <  Xi  <  2  and  r»  =  q\;  Ci  =  qz2  with  arbitrary  q\,q2  G  (0, 1), 

and  then  to  choose  Si  according  to  (4.36)  or  (4.43),  (4.44).  However,  the  calculation 
of  d(y)  or  d{y)  may  be  difficult. 

There  are  no  principal  difficulties  to  extend  this  consideration  to  other  objective 
functions  of  the  form  J(u)  =  \\Cyu  —  yd\\n,  where  TL  is  a  Hilbert  space  (on  fl,  Q  or 
S),  C  G  C(Y,H)  and  yd  G  H. 
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Abstract.  J.-L.  Lions  proved  several  observability  theorems  for  coupled  linear  distributed 
systems  provided  the  coupling  parameters  are  small  enough.  It  remained  an  open  question 
whether  the  assumption  on  the  smallness  of  these  parameters  is  necessary  for  the  validity  of 
the  results.  Using  nonharmonic  analysis  the  first  two  authors  proved  recently  that  in  some 
cases  the  observability  holds  in  fact  for  almost  all  values  of  the  parameters  if  the  underlying 
domain  is  an  open  ball.  Combining  this  method  with  a  compactness-uniqueness  method 
developed  by  the  third  author  we  extend  these  results  for  all  sufficiently  regular  bounded 
domains. 

1991  Mathematics  Subject  Classification.  35L05,  35Q72,  93B05,  93B07,  93C20,  93D15 

Key  words  and  phrases,  observability,  controllability,  stabilizability  by  feedback,  partial 
differential  equation,  wave  equation,  Petrovsky  system. 


1.  Introduction 

Let  fi  be  a  bounded  open  domain  of  class  C 4  in  Rn.  We  shall  denote  by  v  the 
outward  unit  normal  vector  to  its  boundary  T.  Given  two  real  numbers  a  and  b, 
consider  the  following  system: 


(1.1) 


u"  —  Aui  +  bu2  =  fi  in  R  x  fi, 
v!<2  +  A  2u2  +  aui  —  /2  in  IxSl, 

<  Ui  =  u2  =  A u2  =  0  on  R  x  T, 
«i(0)=Mio  and  ^(O)  =  «n  in  fi, 
u2(0)  =  W20  and  *4(0)  =  w2i  in  fi. 


One  can  prove  by  standard  methods  that  for  any  given  functions  /1  G  L)oc(R;  L2(fi)), 
h  G  Ljoc(R;  H~l  (fl)) ,  and  initial  data 


(uio,uu,u2o,u2i)  G  Hq(Q.)  x  L2(fi)  x  //,|(fi)  x  H  ^fi), 
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this  problem  has  a  unique  weak  solution  satisfying 

«i  G  C(R;  H%(Si))  n  C^R;  L2(ft)) 

and 

u2  G  C(R;  H^Q))  n  <7X(M;  /T^ft)). 

We  define  the  initial  energy  of  the  solutions  by  the  formula 

E0  =  ||wio||lfi(n)  +  llun  111,2(0)  +  llu2o|llr0i(n)  +  ll«2ill#-i(n)- 

Let  us  first  assume  that  the  system  (1.1)  is  uncoupled,  i.e.  a  =  b  =  0.  Then  it 
follows  from  some  results  of  [11],  [12]  (p.  44,  theoreme  4.1,  p.  287,  theoreme  4.1)  that 
for  every  bounded  interval  I  containing  0  the  solutions  of  (1.1)  satisfy  the  estimates 

(1.2)  J^(d„ui)2  +  (d„u2)2  dT  dt  <  CiEn+  Ci||/i||/J2(j.i2(n))  +  Ci\\f2\\2L2^.H-i^) 

with  a  constant  cv  Here  and  in  the  sequel  all  constants  are  assumed  to  be  independent 
of  the  particular  choice  of  the  initial  data. 

Now  fix  two  open  subsets  Tj,  F2  of  F  and  a  positive  number  To  such  that  for  any 
bounded  intervals  fy,  I2  of  length  |/i|  >  To  and  \I2\  >  0,  in  case  a  =  b  =  0  and 
/x  =  /2  =  0  the  solutions  of  (1.1)  also  satisfy  the  inverse  inequalities 

(1.3)  E0  <  c2  f  [  ( duUl )2  drdt  +  c2  [  [  {d„u2)2  dT  dt 

Jl\  JTi  J I2 

with  a  suitable  constant  c2. 

The  purpose  of  this  paper  is  to  show  that  then  analogous  estimates  hold  for  the 
coupled  system,  too,  at  least  for  almost  all  choices  of  the  coupling  parameters  a  and  b. 

Remarks.  According  to  earlier  results  of  Lions,  Zuazua  and  Komornik  [11],  [12]  (p. 
55,  theoreme  5.1,  p.  296,  theoreme  4.3),  [13]  (pp.  474-478),  [5]  (p.  82,  theorem  6.11) 
on  the  wave  equation  and  on  Petrovsky  systems,  the  estimates  (1.3)  are  satisfied  for 
example  if  there  exists  an  open  ball  B(x\\Ri)  containing  fl  and  a  point  x2  G  R"  such 
that 

(x  -  xi)  ■  u(x)  <  0  on  r  —  Fi, 

(x  —  x2)  ■  v(x)  <  0  on  r-r2, 

and  if  the  lengths  of  Iu  I2  satisfy  \h\  >  2i?i  and  |/2|  >  0.  Let  us  emphasize  the  fact 
that  I2  can  be  arbitrarily  small:  this  is  due  to  the  infinite  propagation  speed  for  the 
Petrovsky  system. 

If  ft  is  of  class  67°°,  then  much  weaker  sufficient  conditions  were  obtained  by  Bardos, 
Lebeau  and  Rauch  [1],  [2],  [10]:  every  ray  of  geometric  optics  in  ft  meets  r:  x  fy  at 
some  nondiffractive  point,  and  there  exists  a  bounded  interval  /2,  possibly  longer  than 
I2,  such  that  every  ray  of  geometric  optics  in  ft  meets  r2  x  I’2  at  some  nondiffractive 
point.  Note  again  that  there  is  no  assumption  on  the  length  of  J2:  it  can  be  arbitrarily 
small. 

See  also  Burq  [3]  for  an  extension  of  the  results  of  [1]  and  [2]  to  the  case  where  ft 
is  only  of  class  C3. 

Our  main  result  is  the 
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Theorem  1.1.  (a)  Given  two  real  numbers  a,b  arbitrarily  and  a  bounded  interval  I 
containing  0,  in  case  /4  =  /2  =  0  the  solutions  of  (1.1)  satisfy  the  estimates 

(1.4)  J  J^{dvui)2  +  {dvu2)2  dF  dt  <  c3E0 
with  some  constant  c3 . 

(b)  For  almost  all  choices  of  the  pair  (a,  b)  €  R2,  if  1 1,  I2  are  two  bounded  intervals 
satisfying  |/i|  >  2 Ri  and  |/2|  >  0,  then  in  case  /i  =  f2  =  0  the  solutions  of  (1.1) 
satisfy  the  estimates 

(1.5)  E0  <  c4  ff  (d^Ui)2  dr  dt  +  c4  LL  {dvuff  dr  dt 
with  a  suitable  constant  c4. 

Remark.  In  the  special  case  where  Q  is  a  ball,  theorem  1.1  was  proved  earlier  in  [7], 
[8]  by  a  direct  computation  (leading  to  explicit  constants).  The  proof  given  below  is 
different:  it  is  based  on  a  compactness-uniqueness  method  introduced  in  [13].  While 
it  is  indirect  and  so  the  constants  are  not  explicit,  it  works  for  all  sufficiently  regular 
bounded  domains. 

Applying  the  duality  method  from  [4]  or  the  Hilbert  Uniqueness  Method  from 
[11],  [12]  one  can  deduce  from  theorem  1.1  an  exact  boundary  controllability  result 
concerning  the  system 


'y'l  -  A +  ay2  =  0  in  (0,  T)  x  Q, 
y2  +  A 2y2  +  byl=0  in  (0,  T)  x  U, 
yi  =  Vi  on  (0,T)  x  r, 
y2  =  0  and  A y2  =  v2  on  (0,  T)  x  T, 
l/i  (0)  =  2/10  and  y[(  0)  =  yn  in  Q, 

.2/2(0)  =  y20  and  y'2{  0)  =  y2 1  in  Q. 

Let  us  introduce  for  brevity  the  Hilbert  space 

h  =  L2{n)  x  h~\ n)  x  Hl{n)  x 

Theorem  1.2.  Fix  positive  numbers  0  <T\  <T2  <T  such  that  T  >  T0,  and  let  T 1 ,  T2 
be  as  above.  For  almost  all  pairs  ( a,b )  €  R2  the  problem  (1.6)  is  exactly  controllable 
in  the  following  sense:  Given 

(2/iOj  2/1112/20, 2/21)  6  H  and  (zio,  zn,  z2o,  ^21)  G  H 

arbitrarily,  there  exist  control  functions  v\,v2  6  L2(0,  T;  L2(T))  such  thatvi  vanishes 
outside  o/(0,  T)  x  Ti,  v2  vanishes  outside  of  (Ti,T2)  x  T2,  and  the  solution  of  (1.6) 
satisfies 


2/i (T)  =  zio,  y[(T)  =  zn  and  y2(T)  =  z20 ,  y2{T)  =  z21  in  Q  . 
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Next,  applying  another  general  method  developed  in  [6],  one  can  deduce  from 
theorem  1.1  a  uniform  boundary  stabilization  result  concerning  the  system 

'y'{  -  Aj/i  +  ay2  =  0  in  (0,  oo)  x  Cl, 
y”  +  A2y2  +  by!=0  in  (0,  oo)  x  Q, 
yi  =  v  i  on  (0,oo)xr, 

'  7/2  =  0  and  A y2  =  v2  on  (0,  oo)  x  T, 

2/i(0)  =  2/io  and  y[{  0)  =  yn  in  O, 

.2/2(0)  =  2/20  and  2/2(0)  =  2/21  in  Cl. 

Theorem  1.3.  Fix  an  (arbitrarily  large)  positive  number  co,  and  let  Ti,  T2  be  as  above. 
For  almost  all  pairs  (a,  b)  e  M2  there  exist  two  bounded  linear  maps 

P:H^L2(Cl)  and  Q  :  H  -» 

and  a  positive  constant  M  such  that,  setting 

Vi  =  dl/P(yi,y'1,y2,y'2)  and  v2  =  dvQ(y\, 2/i, 2/2, 3/2) 

the  problem  (1.7)  is  well-posed  for  all 

(2/io,2/ii)2/2o, 2/21)  £  H, 

and  its  solutions  satisfy  the  estimates 

Il(l/i,t/i,lfe.l4)(*)llff  <  Me-Ut\\(yw,yu, 2/20, V2i)\\h 

for  all  t>  0. 

Since  both  theorems  1.2  and  1.3  can  be  obtained  from  theorem  1.1  in  a  standard 
way,  we  shall  only  prove  theorem  1.1  below. 

2.  Proof  of  Theorem  1.1 

First  we  prove  the  inequality  (1.4).  Applying  (1.2)  with  /1  =  —bu2  and  f2  =  —aui 
we  obtain  the  estimate 

ll^Ul|li2(7;L2(r))  +  ll^U2||i2(J;i2(r))  <  c(E0  +  |M|£l(/;£2(n))  +  l(fi)))- 

By  the  well-posedness  of  the  problem  (1.1)  we  have 

Ilui|l!1(j;fl2(n))  +  tlu2||ii(j;Hi(n))  <  cE0 

and  therefore 

ll^Wl|ll2(/;i2(r))  +  ll^U2||L2(/;L2(r))  <  cE0, 


as  stated. 
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Turning  to  the  proof  of  the  inequality  (1.5),  let  us  write  the  solution  u  of  (1.1)  as 
u  =  v  +  w  where  v  and  w  are  solutions  of  the  following  two  problems: 

v "  —  Avi  =  —bu2  in  R  x  Cl, 
v'j  +  A2v2  =  —  aui  in  IxQ, 

■ 

v1  =  V2  =  Ai»2  =  0  on  lx  f, 
r>i(0)  =  v[(0)  =  «2(0)  =  w2(0)  =  0  in  Cl, 

w"  —  Awi  =  0  in  R  x  fi, 
w2  +  A2w2  =  0  in  KxQ, 

-  Wi  =  W2  =  Aw2  =  0  on  R  x  T, 
w1(0)=«io  and  w'1(0)=uu  in  Cl, 
w2(0)  =  u2Q  and  ?n2(0)  =  u2\  in  Cl. 

Applying  the  estimates  (1.3)  to  the  uncoupled  system  (2.2)  we  obtain  the  inequality 
Eq  <  c||5t/Wl||x,2(71.i2(ri))  +  c\\dvW2\\2il2(i2.L2(T2y>, 

whence 

(2-3)  Eo  <  c||3^Ui||^2(7i;i2(ri))  +  c 1 1 3vu2 1 1 1,2 (/2 .£,2 (r2) )  +  cR 

with 

R  —  i|9i/Vl||£2(/i;£,2(ri))  +  ||5l/t;2||£2(/2.£,2(r2)). 

Next  applying  the  estimates  (1.2)  to  the  system  (2.1)  with  a  bounded  interval  I 
containing  0  ,  I\  and  /2,  we  obtain  the  inequality 

R  <  CllU2|||l(/;£2(n))  +  c[|Ul|||l(/;flr-l(n)). 

Using  the  well-posedness  of  problem  (1.1)  and  the  compactness  of  the  embeddings 
Hq(CI)  C  L2(C2)  and  Hq(CI)  C  H _1(fl),  this  implies  that  R  is  compact  with  respect 
to  Eo . 

Due  to  this  compactness  we  can  apply  a  method  developed  in  [13]  for  the  proof  of 
the  inverse  inequality  (1.5).  Using  (2.3)  first  we  reduce  our  problem  to  the  following 
uniqueness  property:  if  a  solution  w  of  (2.2)  satisfies  duw\  =  0  on  Id  xR  and  dvw2  =  0 
on  r2  x  /2,  then  in  fact  all  initial  data  vanish  and  therefore  w±,  w2  vanish  identically. 
Next,  proceeding  always  as  in  [13],  we  reduce  this  to  the  following  simpler  uniqueness 
property:  if  for  some  functions  w\ ,  w2  €  Hq  (Cl)  and  for  some  complex  number  A  we 

{—Awi  +  bw2  =  \w\  in  D, 

A2w2  +  awi  =  Aw2  in  Cl, 

Wi  =  w2  =  A  w2  —  0  on  F 

(dvwi  =0  on  Ti, 
dvw2  =0  on  r2, 

then  in  fact  w\  and  w2  vanish  identically  in  Cl. 


have 

(2.4) 

and  if 

(2.5) 


(2.1) 

and 

(2.2) 
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Now  we  shall  prove  that  this  last  uniqueness  property  holds  for  almost  all  choices 
of  the  pair  (a,  b).  Let  us  fix  an  orthonormal  basis  zi,z2,  .  .  .  in  L2(Q),  consisting  of 
eigenfunctions  of  —A  in  Hq(Q): 

-A  zn  =  7 nzn  in  Q, 
zn  =  0  on  T, 

0  <  7i  <  72  <  •  •  •  , 

7 n  — ►  oo. 

For  every  pair  of  integers  ( k ,  l )  with  7*  ^  7 1,  set 

X  _  7fc  ~  7? 

Kl  + 

and  consider  the  countable  set 

C  ■  fc  =  l,2,...}U{(7fc-AM)(7fc-^,i)  : 

Since  the  set  {(a,  b)  G  M2  :  ab  e  C}  is  the  countable  union  of  real  analytic  curves 
in  R2,  we  have  ab  £  C  for  almost  all  pairs  (0,6)  €  R2.  Henceforth  we  assume  that 
ab  £  C  and  we  shall  prove  the  above  mentioned  uniqueness  property. 

First  we  show  that  every  eigenvector  W  =  {wi,w2)  of  (2.4)  has  the  form  W  =  (3z 
for  some  (3  €  C2  and  for  some  nonzero  eigenfunction  of  -  A  in  H^(Cl): 


(2.6) 


— Az  =  7  z  in  Cl, 
z  =  0  on  T. 


For  this  first  we  seek  eigenvectors  of  the  form  W  =  (3zk  for  every  fixed  k  >  1. 
Substituting  into  (2.4)  we  obtain  for  (3  =  (/?i,/?2)  the  linear  system 


f  (7 k  —  A)/?i  +  6/?2  —  0, 

\oA  +  ill  ~  A)&  =  0. 

Since  by  our  assumption  ab  ^  —(7 1  —  7 *)2  its  determinant  has  two  different  roots 
A,  it  follows  that  the  problem  (2.4)  has  two  linearly  independent  eigenvectors  of  the 
form  W2fc- 1  =  Pik-i Zk  and  W2k  =  (32kZk- 

Denoting  by  Z  the  linear  hull  of  z\,  z2, . . . ,  it  follows  that  W\,  W2,  ...  span  Z  x  Z, 
which  is  dense  in  L2(Cl)  x  L2(Cl).  We  have  thus  found  a  complete  linearly  independent 
sequence  of  eigenvectors  of  the  problem  (2.4).  It  can  be  shown  that  this  sequence  is 
in  fact  a  Riesz  basis,  see  [9] . 

Let  us  denote  by  Aj,  A2,  ...  the  eigenvalues  associated  with  W\,  W2,  ....  If 
they  are  pairwise  distinct,  then  the  problem  (2.4)  has  no  other  eigenvectors  than  the 
multiples  of  the  vectors  Wk-  If  some  of  these  eigenvalues  coincide,  then  all  linear 
combinations  of  the  corresponding  H4s  are  also  eigenvectors  of  (2.4)  with  the  same 
eigenvalue.  Now  it  follows  from  our  assumptions  on  ab  that  Xk  =  A 1  implies  7*  =  7 1, 
and  therefore  every  eigenvector  W  =  (u>i,w2)  of  (2.4)  has  the  form  W  —  j3z  for 
some  (3  e  C2  and  for  some  nonzero  function  satisfying  (2.6).  Indeed,  if  7*,  ^  7 j  but 
\k  =  Xt  =:  A,  then  A  solves  both  characteristic  equations 

(7*  -  A) (7!  —  A)  -  ab  =  0  and  (71  -  X)(-y?  -  X)  -  ab  =  0. 
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Hence  an  easy  computation  gives  that  A  =  A;y,  and  therefore  ab  €  C,  contrary  to 
our  assumptions. 

We  have  thus  shown  that  under  the  assumption  ab  $  C  every  eigenvector  W  = 
(u>i,W2)  of  (2.4)  has  the  form  {faz,faz)  for  some  complex  numbers  fa,  fa  and  for 
some  nonzero  function  satisfying  (2.6). 

Now  using  (2.5)  and  applying  Carleman’s  uniqueness  theorem  we  conclude  that 
fa z  =  faz  =  0  in  £2,  i.e.  W  =  0  in  Cl,  as  stated. 

Remark.  Note  that  the  proof  of  the  theorem  gives  more  than  stated:  the  set  of 
exceptional  matrixes  (a^)  is  not  only  of  measure  zero,  but  a  union  of  countably 
many  real  analytic  curves. 
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On  Dynamic  Domain  Decomposition  of  Controlled 
Networks  of  Elastic  Strings  and  Joint-Masses 
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Abstract.  We  consider  a  planar  graph  representative  of  the  reference  configuration  of  a 
network  of  elastic  prestretched  strings  coupled  at  the  vertices  of  that  graph.  Some  or  all 
of  the  vertices  may  carry  a  point  mass,  and  at  those  nodes  dry  friction  on  the  plane  may 
occur.  We  briefly  describe  the  model  and  some  results  on  well-posedness  and  control  of  such 
systems  obtained  in  the  literature.  We  then  introduce  a  dynamic  domain  decomposition 
based  on  a  Steklov-Poincare-type  operator.  The  analysis  is  given  for  the  time-domain  and 
the  frequency-domain.  Optimal  control  and  problems  of  exact  controllability  are  formulated 
and  investigated  in  terms  of  the  decoupling  procedure. 

1991  Mathematics  Subject  Classification.  93C20,  93C80,  93B05,  65N55 

Key  words  and  phrases.  Dynamic  domain  decomposition,  strings,  networks,  joint  masses, 
dry  friction,  Steklov-Poincare-operators  for  networks,  differential-delay  systems,  optimal 
control,  controllability. 


1.  Introduction 

We  consider  a  network  of  dynamic  elastic  strings  as  in  [10],  [9],  [13].  Let  G  = 
(V,E)  be  a  planar  connected  graph  with  vertices  E,  j)E  =  nv  and  edges  E,$E  =  ne. 
Each  edge  is  representative  of  a  (possibly  prestrechted)  linear  string  in  its  reference 
configuration.  We  label  nodes  Vj  by  capital  letters  and  edges  by  lower  case  letters. 
At  a  typical  node  Vj  we  have  d(vj)  incident  edges,  the  indices  of  which  we  label 
i  G  £j.  To  each  i  £  £j  we  assign  e,j  :=  —1  if  the  edge  i  starts  at  Vj  and  e,j  :=  1 
else.  We  denote  by  xzj  the  number  lz  if  e,j  =  1,  0  if  e,j  =  —1.  We  introduce 

o  o 

V'=  {v  G  V\d(v)  >  1},  dV  =  V\  V  as  multiple  nodes  and  simple  nodes,  respectively. 
Let 

ri(x, t )  :=  Ui(x, t)ei  +  Wi{x, t)ef 

denote  the  deformation  of  the  Tth  string,  where  Ui ,  wz,  e*,  ef  denote  the  longitudinal, 
vertical  displacement,  and  the  unit  vectors  along  the  undeflected  centerline  and  its 
orthogonal  complement,  respectively. 

We  signify  nodes  where  Dirichlet  conditions  hold  by  Vp;  for  simplicity  of  notation 
we  impose  Vp  C  dV .  Correspondingly,  Neumann-nodes  are  denoted  by  Vy  C  dV. 
Let  Kz  denote  the  local  stiffness  matrix.  Then  we  have  the  systems  of  equations 
governing  the  motion  of  the  entire  network. 

(1.1)  h  =  Kir",  i  =  l:ne,Xi£  (0,4),  t  G  (0,T) 

(1.2)  Xd{xdD)  =  0,  V£>  G  Vd,  d  G  £d,  t  G  (0,T) 
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(1.3)  ri(xjj)  =  r j(xjj),  Mi,]  G  £j,vj  GP 

(1.4)  euKy^xu)  =  - mjfiixu )  +  fj,  vj  GV  UPW,  t  G  (0,  T) 

iE£j 

(1.5)  Vi(Xi)  0)  —  ^  (Qih'ji  ^  ^  ■ 

While  (1.1)  is  the  obvious  wave  equation  for  the  i-th  string  with  homogeneous  Dirich- 
let  conditions  (1.2),  equation  (1.3)  gives  the  continuity  of  displacements  accross  the 
joint  vj.  Equation  (1.4)  is  the  balance  of  forces  at  the  joint  Vj  including  the  possi¬ 
bility  of  an  additional  mass  mj  and  an  external  force  fj  there.  Position  (1.5)  finally 
represents  the  initial  conditions. 

System  (1.1)— (1.5)  has  been  shown  to  be  wellposed  in 
ne  ne 

H  :=  f[L2(0,li),  V  :=  {r  G  0,k)\r  satisfies  (1.2),  (1.3)} 

i=l  i= 1 

with  the  typical  setup  V  CC  H  CC  V*,  and  mj  =  0  Vj.  We  note  that  all  spaces 
1/2)  H 1  etc.  appearing  in  this  paper  are  to  be  considered  as  spaces  of  functions  into 
the  plane. 

Theorem  1.1.  [9]  Let  mj  =  0  V  J,  then 

Tly 

M(ro,n)  GVxH,  fe  HL2(0,T)3\r: 

j= i 

r  G  C(0,  T,  V)  n  C\0,  T,  H)  n  C2{ 0,  T,  V")  , 
r  satisfying  (l.l)-(l.b)  in  a  natural  weak  sense. 

Remark  1.2.  More  can  be  said  in  terms  of  regularity,  when  masses  at  joints  are 
present.  See  Section  3  below.  In  case  of  nonhomogeneous  Dirichlet-data,  wellposed- 
ness  for  the  system  with  masses  mj  ^  0  has  been  obtained  in  [15]. 

Remark  1.3.  Problems  of  exact /approximate  controllability  have  been  discussed  in 
detail  in  [9]  where  also  3-d-networks  have  been  considered.  See  also  [9],  [10]. 

Remark  1.4.  The  balance  of  forces  at  multiple  nodes  (1.4}  can  be  extended  in  various 
ways,  in  order  to  account  for  nonlinear  phenomena  as  e.g.  dry  friction  on  the  plane 
or  elastic  and  rigid  obstacles,  rigid  bars  between  joints  ect. .  We  do  not  have  sufficient 
space  to  discuss  these  models  here  in  detail.  To  illustrate  the  possibility  of  treating 
such  phenomena  in  the  context  of  the  subsequent  domain  decomposition,  we  provide 
a  model  of  dry  friction. 

(1.6)  euKiri(xiJ )  +knjfi{xij)  G  - pjd\fi(xu)\ ,  vj  GV  l)VN,  t  G  (0,T). 

it£j 

In  a  more  refined  model  we  have 

(1.7)  eijKiriixij)  =■  Fj  =  -mjriixij)  -  p.jp j, 

i€£j 


pj(t)  = 


fi{XiJ ) 


j)  r 


if  fifaj)  ±  0 


where 
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Pj(t)=Z^§rr  ae  [-§>§]>  ifn(*ij)  =  0,  Fj(t)?  0 
|Fj(t)|  Mj  Mj 

/>j(i)  =  0  else, 

and  fJj,  fisj  are  adhesive  and  shear  frictional  moduli.  See  Panagiotopoulos  [12]  for 
dry  friction  models  and  results  on  well-posedness  for  single-element  systems.  The 
subsequent  analysis  will  show  that  it  is  possible  to  reduce  the  discontinuous  part  of 
the  PDE-problem  above  to  a  finite  dimensional  one. 


2.  A  dynamic  Steklov-Poincare- Operator  for  networks 

We  first  consider  the  local  Dirichlet  problems  for  each  individual  edge  along  e* 

(2.1)  (p,  =  Kip'l,  (0,  If)  x  (0,  T) 

(2.2)  ipfxij )  =  A j  +  Pi^li  Xij)  A m  T  r ioili  '  %ij) >  (0?-O 

(2.3)  <pi{x,  0)  =  ri0(x),ipi(x,  0)  =  ru(x),  (0,  If)  . 

Obviously,  tpi  can  be  written  as  a  superposition 

(2.4)  Vi(x,t,ri0,rii,\j,\M)  =  ifi(x,t,0,0, Xj,  XM)  +  7?i(a:,t,AoT*i,0,0)  . 

Lemma  2.1.  Let  (r0,ri)  €  V  x  H,  Xj,Xm  G  Hq_(0,T)  (i.e.  Xj,Xm  G  {/f!(0 ,T)| 
A(0)  —  0).  Then  for  each  i  there  exists  a  unique  weak  solution  ifi  to  (2.1  )-(2A) 
satisfying 

ifi  €  C(0,  T,  H\ 0,  h ))  n  C\ 0,  T,  L\ 0,  k)) . 


Proof.  The  proof  is  standard.  Nevertheless,  we  need  a  precise  representation  any¬ 
way.  In  particular,  we  will  use  ifi(x,t,0,0,Xj,XM)  below.  r]i(x,t,rio,rn,0,0)  can  be 
constructed,  say,  by  a  d’Alembert-Ansatz.  We  focus  on  ifi,  and  use  a  Fourier- Ansatz. 

Ki  =  ki((  1  -  —)I  +  —eiej')  , 

V  Si  Si  ) 

where  >  1  represents  the  amount  of  stretching.  We  have  ipi=:ip\ei  +  iffeX,  hence 
(2.1)-(2.4)  decouples  into 

(2.5)  ft  =  my,  ft  =  him" 

(2.6)  ipli(xij)  =  Aj,  Viixij)  =  Xvj 

(2.7 )  -  xu)  =  XlM,  Viik-xu)  =  XVM  , 

plus  zero  initial  conditions,  where  p f  =  k,,  qf  =  k,(  1  —  l/sf).  Hence,  solving  the 
in-plane  problem  (2.1)-(2.4)  comes  down  to  solving  scalar  problems  (2.5)-(2.7).  We 
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express  V©  iff  as  follows 

$  =  4 + vl  $1  =  zi  +  yvi 

=  7"(^i  %iJ  “1“  7 ~(%iJ  (0 

H  H 

z-(x,  t)  =  j{li  -  Xu  +  eux)Xvj(t)  +  j{xu  -  eux)XvM(t)  . 

(2.8)  y\  =  Pi(y\)"-'4>  $  =  QKyvi)"  -  % 

(2.9)  y\{xu)  =  0  =  y\{k  -  xu)  ,  »?(®u)  =  0  =  y-(h  ~  xu) 

+  zero  initial  conditions. 

By  standard  calculations  we  obtain,  with  pL  := 

(2.10) y|(x,  t)  -gji -  ♦)  {-  (0'^icos  (f  x„) 

+  ©1,2ei4cos(f  (',“iu))x'*'(s)}*  •  © 7  “"¥*  ■ 

If  we  integrate  (2.10)  by  parts  and  approximate  Hq_(0,  T)  by  Hq_(0,T),  the  asymp¬ 
totic  behaviour  of  the  Fourier  coefficients  implies  that  ip\  have  the  properties  required. 
The  same  obviously  holds  for  ip”.  □ 

Lemma  2.2.  Let  (pi  i  =  1, . . .  ,ne  be  as  in  Lemma  2.1.  Then  y>i(x,  t )  =  r»(a;,  t),  where 
Tii  =  1, . . .  ,ne  solve  (l.l)-(l.h)  if  and  only  if 

(2.11)  £  euKiriixu)  +  mjXj  =  fj  V  J  GV  UVN  . 

ie£j 


Proof.  If  ipi  =  Vi  then  (2.11)  is  obvious.  Conversely,  since  A j(t)  —  (pi{xu,t ),  condi¬ 
tion  (2.11)  is  just  (1.4)  with  (pi .  By  construction 

<fii(xu)  =  ri0{xij)  +  X  j  =  rk0(xkJ )  +  A  j  —  pk(xkJ)  , 

since  r0  G  V.  For  Dirichlet  nodes  vp  we  have  Xp,  =  0.  The  argument  is  applied  to 
strong  solutions  and  then  to  mild  solutions.  □ 

We  return  to  (2.10)  and,  consequently,  require  A  G  H$_  for  the  time  being.  We 
obtain 


(ylY(xiJ,t)  = 


■  euy  E  —  /  sin(/4-)1/2(t  -  s)  {4(©  -  (-l)jXM(s)}  ds 

H  j=i  Pi^J  •'O 

O  OO  7  pi 

■  eur-  E  -  /  M^)l/2(s)  {  Xlj(t  -s)-  (-l)jXlM(t  -  «)}  ds 

kPi  j— i  it 3  ■'O 

•  tiJ  j  .  2^  /,  Gij{s)ds  , 

HPi  fc=0  k=0  ^kli/pi 
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where  mt  =  argminfc{(fc  +  1  )k/pi  >  t}  and  the  integrand  is  put  to  zero  for  s  G 
[t,  (mt  +  1  )k/Pi).  Hence, 


(2.12) 

Now, 


0  00  1.  mt  rii/Pi 

(y\Y(xiJ,  t)  =  euj-  GdT  +  kli/pi)dT  . 

HPi  j=i  71 J  fc_Q  JO 


sin 


Pi K 


4  V  +  t«))  ’ T  +  k”J)  =  G1)W““(xr) 

and  hence  (2.12)  gives 

(2.i3)  am*,..)  -  JT-'* 

Pi  j= 1  fc=0  u 


sin  I  — - — r 


Furthermore 


•  jAj(t  -  r  -  Wj/ft)  -  (-l)JA'M(t  -  r  -  fc/i/ft)}  dr  . 

k  JO  V  li  J  TTJ  Pi 

irno-tT- 


t  dt 


Pi  fli/pi  (,  Pi 

l, 

and  hence  (2.13)  can  be  written  as 

1  mt  rh/pi  00  [  /2»-  \ 

-  s*  Si  s  (t)  ■ 

,  cn  rnn  J 

f2  ft  \ 1/2 


k  even 

■  A lj(t  -r-k li/pi)  + 

(?)T?-(f')*(  f) 

•  -  r  -  fc/i/pi)  j  dr 

e 

Pi  J  o  \  U  j  J  0 

-  .1  .1  J 


.  Pi*] 
sin  — - — r  • 


1/2 


sm 


ftTTJ 


2pa>'!  r‘-/’'P‘tsiu(Mlt]  dt 

li  \  li 


k  odd 

'2ftN 1/2 

l 


)  sin  'ji-i Yj  ■  A lj(t  -t  -k h/pi) 


A  2 Pi  A 1/2  fh/K  /  ft  .A  -  /'ft7rl.A  „  /2ft  A 1/2  .  -  .  /ft^J  A 
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1  K/21 

=  — eiJ  £  {A lj(t  ~  2(&  +  1  )U/Pi)  +  Xj(t  —  Zj/pj) 

Pi  k= 0 

~kPi/h(Xlj{t  ~~  2(fc  +  l)Zj/pj)  —  Aj(t  —  2kli/pi))  } 

i  r(mt-i)/2i 

--ey  £  { \lM(t-(2k  +  3)li/pi  +  \lM(t-(2k  +  l)k/Pl ) 

Pi  k=0 

+Pi/h^lM(t  ~  (2k  +  3 )U/pt)  -  XlM{t  ~  (2 k  +  1  )li/Pi))  } 


1  r("»t-2)/2i  i 

(rf)'(XiJ,t)  =  -eu{X‘j(t)  +  2  £  XlJ(t-2(k  +  l)li/pi}-TeiJXlJ(t ) 

Pi  k= 0 

i  r  r(»H-i)/2i .  i  i 

- <  2  £  A^(t  —  (2k  +  l)k/pi)  >  +  jtijXlM(t)  . 

Pi  {  fc= o  j  li 

Now,  (^)'(*y,i)  =  e^A^t)  -  ~XlM(t),  and  hence 


1  7Tle 

t)  =6y-  {  ^(i)  +  2  £  A lj(t  -  2 (k  +  l)k/Pi) 

(214)  \ 

-2'£X‘M(t-(2k+l)li/pi)}  , 

k= 0 

with  me,  m0,  denoting  the  even  and  odd  boundary  for  the  index  k  obtained  from  the 
previous  formula. 

A  similar  expression  holds  for  the  vertical  component  with  l  replaced  with  v.  With 
the  definitions  P, A  :=  p^ei+q^ej-,  ( DJX)(t )  :=  piXl(t-Tli/pi)ei+qiXv(t-Tli/qi)eP 
we  can  write  (1.4)  with  ipt  replacing  rt  as 

(\  me 

£  Pi )  Aj(t)  +  2  £  £  (Df  k+1) Xj)(t) 

ie£j  J  k— 0  ie£j 

mo 

-2  £  £(£>f+1A iM){t)  =  fj{t )  -  £  euKiTiKxu ,t)  . 

k=0  i££j  i^£j 

Here  vtM  is  adjacent  to  vj  and  belongs  to  the  incident  edge  i.  Note  that  (2.15) 
also  applies  to  a  simple  Neumann-node  vj  where  d(vj)  —  1.  We  may  define  A  := 
(A j,  Ajv,  . . .  )T  and 


5(A)  :=  I  £  euKiipi  {xu,  t,  0, 0,  A  j,  Xm)  +  mjXj(t) 

\ie£j 


D(S)  =  {A  e  n  ^o-(0,  r)|Aj  6  Hl_{ 0,  T)  for  mj  +  0}  . 

j 


(2.15) 
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Then  S  can  be  interpreted  as  a  dynamic  Steklov- Poincare- Operator  for  the  network. 
Indeed,  for  time  independent  problems,  or  those  obtained  by  some  integral  trans¬ 
formation,  S  is  identical  with  a  Steklov-Poincare  operator.  However,  even  in  the 
static  case,  such  operators,  in  the  case  of  networks  of  any  dimension,  do  not  seem 
to  have  received  attention  yet.  The  case  of  coupled  membranes  is  the  subject  of 
current  research.  See  Lagnese  [8] ,  where  a  domain  decomposition  is  used  implicitly. 
Its  importance  is  due  to  the  following  theorem 

Theorem  2.3.  Given  (r0,r1)  €  V  x  H,  f  G  Hj  L2(0,T).  If  A  solves  S( A)  =  F  with 

F  =  \fj~Y,  tijKiV'iixu,  t,  ri0,  ra,  0, 0) 

V  ie£j 

then  the  corresponding  local  solutions  cpt  of  Lemma  2.1  constitute  a  solution  of  (1.1  )- 
(1-5)  according  to  Lemma  2.2. 

The  entire  calculus  leading  to  (2.15)  was  done  to  give  an  explicit  representation  of 
S  defined  by  (2.15). 

Therefore,  in  order  to  solve  problem  (1.1)— (1.5),  we  have  to  solve  the  differential- 
delay  system  of  second  order  in  time  (2.15)  together  with  trivial  initial  histories 
Aj(-s)  =  0  V  s  >  0  VJ  £  k.  Once  that  is  done  step  by  step,  according  to  the 
individual  delays  k/pu  U/qi,  the  solutions  (pi  can  be  computed  in  parallel.  Also,  each 
individual  problem  can  be  solved  in  parallel  using  waveform  relaxation  techniques, 
see  Burrage  [1] . 

Remark  2.4.  If  dry  friction  occurs  at  a  joint  Vj  as  explained  in  Remark  1.4,  one  has 
to  either  add  a  maximal  monotone  operator  to  S,  or  use  the  notation  of  variational 
inequalities  in  order  to  extend  existence  and  uniqueness  to  the  system  with  dry  fric¬ 
tion.  In  particular,  using  the  method  of  steps,  one  obtains  a  sequence  of  ordinary 
differential  equations  with  discontinuous  forcing  term.  The  latter  systems  can  then 
be  treated  step-by-step  in  the  framework  of  DAE’s  (differential- algebraic  equations)  of 
index  2.  We  have  implemented  this  strategy  using  RADA  U5  of  Hairer  and  Wanner 
(see  [4]  as  a  base  reference)  and  projection  methods.  We  have  applied  this  also  to 
beam  networks  with  rotational  dry  friction  between  “pin-joints” .  We  refer  also  to  the 
work  of  Glowinski  et.al.  [3]  (for  matrix  problems),  where  the  time  is  first  discretized  in 
order  to  obtain  an  “elliptic”  problem  together  with  a  projection  accounting  for  the  dry 
friction.  To  apply  their  approach  to  elastic  networks,  using  the  domain  decomposition 
above,  is  the  subject  of  current  research. 

3.  Wellposedness  and  regularity  of  solutions 

It  is  well-known  that  the  solution  Pi(x,t,riO,rn,0,  0)  of  (2.1)-(2.3)  has  the  regular¬ 
ity 

Vi  €  <7(0,  T,  H\ 0,  If))  n  C'1(0,  T,  L2( 0,  If)  . 
for  r0i  G  H1( 0,  If),  ru  G  L2( 0,  If).  In  fact,  one  verifies  that 

V'iixu,  -,rm,ra,  0,0)  G  L2(0,T)  . 
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Lemma  3.1.  Given  (r0,ri)  G  V  x  H,  fj  G  L2(0,T)  V  J  £V  UVN,  there  exists  a 
unique  solution  of  (2.15)  with  Aj(-s)  =  0  V  s  >  0,  J  £  V.  The  regularity  of  \j(t) 
depends  on  the  presence  of  a  mass  mj  at  node  vj: 

i)  if  mj  =  0,  then  Xj  £  #*(0 ,  T),  Aj(0)  =  0 

ii)  if  mj  0,  then  A  j  £  H2(0,T ),  Aj(0)  =  Aj(0)  =  0  . 

Tie 

Proof.  The  right  hand  side  of  (2.15)  is  in  L2(0,  T).  Let  a.\  =  min (h/pi,  h/qf),  then 

solve  (2.15)  on  (0,  cci)  uniquely  for  A  j.  Note  that  because  of  A  j(—s)  =  0  V  s  >  0,  the 
sums  in  (2.15)  are  empty.  Now  A j(t)  satisfies  i),  ii)  for  mj  =  0,  mj  ±  0,  respectively. 
We  do  this  for  all  vertices  and  obtain  A  on  (0,  ax).  By  the  same  procedure,  we  obtain 
the  solution  A  on  (0,  T)  if  we  use  the  solutions  obtained  in  the  previous  step  to  update 
the  history  part  in  (2.15).  This  is  the  classical  method  of  steps  for  differential  delay 
systems,  see  [1].  □ 

Corollary  3.2.  It  is  immediate  from  Lemma  3.1  that  waves  passing  a  node  with  nonzero 
mass  mj  or  waves  generated  at  such  a  node  by  applying  forces  fj,  have  one  more  de¬ 
gree  of  regularity  in  the  edge  incident  at  vj . 

Theorem  3.3.  Let  (r0,ri)  £V  x  H,  fj  £  L2(0,T)  V  J  £V  UV^.  Then  there  exists  a 
unique  mild  solution  r  of  (l.l)-(l.h),  satisfying 

r  £  C(0,  T,  V)  n  C1(0,  T,  H)  . 

Proof.  The  first  part  is  obvious  from  Lemma  3.1  and  Theorem  2.3.  The  second  part 
follows  from  Corollary  3.2.  □ 

Remark  3.4.  We  illustrate  the  smoothing- property  of  the  joint-masses  by  an  exem- 
plaric  situation.  There  are  many  interesting  features  concerning  regularity  issues 
when  masses  are  present  at  joints.  We  don’t  have  sufficient  space  to  go  into  details. 
See,  however,  Willi  and  Baker  [16],  for  the  propagation  of  singularities  in  DDEs 
and  [5],  [15]  for  two-span  strings. 

If  ri0,  m  7^  0  on  a  set  I  of  edges  and  ri0  =  riX  =  0  on  E\I  and  mj  =£  0  Vvj  £  I 
such  that  adj(vj)  n  (E\I)  ^  0.  Then  on  E\I  the  restrictions  n  of  r  satisfy  n  £ 
£7(0,  T,  H2( 0,  h))  H  £7^0,  T,  Hl(0,  If)  n  C2( 0,  T,  L2(0,  If). 

Obviously,  Theorem  3.3  extends  Theorem  1.1  to  the  case  with  mj  ±  0. 


4.  A  frequency  domain  representation 


We  switch  to  a  nodal  description  of  (1.1)-(1.5)  as  in  [11].  The  edge  connecting  the 
adjacent  nodes  vj,vm  is  labeled  by  JM  (i.e.  r,  <->  rjM,  £j  Lj  =  {M\ vm  vj} 
etc.).  Denote  kjM  :=  1  /pjm,  kjM  '■=  1/Qjm-  Djm  diag (j>jm,Qjm),  kjM  := 


d\a.g(kjM,kjM),  Rjm 


cos  0jm  sin  Ojm 
-  sin  Ojm  cos  0JM 


where  eJM  =  ( cos0jM,sm0jM)T , 


9jm  being  the  angle  between  e,  and  (1,0)  in  the  reference  frame.  The  notation 
rJM  implies  that  rJM{ 0)  relates  to  nfaj).  For  the  sake  of  convenience,  we  take 
each  edge  twice,  namely  rJM,rMj ,  thereby  we  obtain  a  multi-digraph.  Of  course, 
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tjm{x)  =  rMj{ljM-x),  implying  uJM(x )  =  -uMj{Ijm~x),  wjm{x)  = 
x).  Consider  the  Laplace-transform  of  (1.1)  (1.5): 

roc 

£f(s)  :=  /  e~stf(t)  dt  =  f(s ) 

Jo 

(4.1)  s2fJM{x ,  s)  -  srJM(x ,  0)  -  rJM(x,  0)  =  KJMfjM(x,  s) 

(4.2)  xdm{  0,  s)  =  0,  Me  £d,  D  e  Vd 

(4.3)  fJM{ 0,  s)  =  fJN( 0,  s)  =  X j(s)  VM,N  e£j 

(4.4)  -  Y  KjMf'JM(0,  s)  +  mjs2X(s)  -  smjA(O)  -  tojA(O)  =  fj(s) 

Me£j 

(4.5)  rJM(x,  0)  =  r°JM(x),  r°JM{x,  0)  =  r^^x)  J  EV  UVN  . 

Again  tjm  =  uJM£jm  +  wJMejM,  hence  (4.3),  (4.4)  translate  to 

(4-6)  (  lJ2  )  (0,  s)  =  Rjm-jk  (  lJ^K  )  (0,  s)  =  Rjm\j{s) 

(4.7)  mjs2X(s)  -  Y  R-JmKjM  (  ^ K  )  (0 ,s)  =  Fj(s )  , 

Me£j  \  WjM  J 

with  Fj(s)  :=  fj(s)  +  smjX(0)  +  tojA(0).  Put  aJM  :=  (alJM,a2JM)T ,  dJM  = 
(4m.4m)T.  then 

(4.8)  (  j  (x,  s )  =  eak™x  (aJM(s)  +  DJM~  J*  e-sk™TbJM(s,  r)  dr) 

+e-skJMx  _  Djm^_  £  eskJM TbjM(Sj t)  dr) 

(4-9)  ^  "ibjM  )  =  ajM(s)  +  dJM{s), 

(  1  s)  =  skJM(aJM{s)  —  dJM{s))  =  RjmXj{s)  . 

\  WJM  j 

There  are  at  least  two  possibilities  to  proceed  further.  One  is  to  derive  scattering 
relations  based  on  dJM  (departure)  and  aJM  (arrival)  as  in  [11],  [10].  Based  on 
this  scattering  analysis,  we  have  been  able  to  develop  nonclassical  control  strategies, 
localization  of  energy-fluxes,  spectral  properties  and  much  more.  Here  we  focus  on  the 
relation  between  the  data  and  the  A/s,  as  being  proposed  in  Section  2. 

Therefore,  the  upcoming  analysis  is  suitable  for  parallelization.  We  use  (4.8)  and 
conclude,  with  Gjm(Ijm,s)  :=  DJMFf^M  e~skjMTbJM(s,T)  dr,  after  some  calculus, 

aJM{s )  =  —(I  —  e~2skjMljM)~l  {e~skjMljM RmjXm{s) 

(4.10)  +e~2skjMljM RjMXj(s)  -  (/  +  e-°k’Ml’M)  G(Ijm,  s)} 

dju(s)  =  Rjm^j{s)  —  dJM{s)  . 
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Equations  (4.10)  provide  an  explicit  representation  of  the  complex  amplitudes  of 
arriving  and  departing  waves  in  forms  of  the  traces  \j,\m  at  the  nodes  vj,vm- 

injs2Xj(s)  +  s  |  ^2  R-jmDjmRjm  Xj(s) 

\me£j  ) 

+  2s(  T  Xj(s) 

(4.11)  \Me£j  / 

+  2s  £  R-jmDjm(I  ~  e~2skjMljM)-1e~skjMljM RmjXm(s) 

Me£j 

=  Fj  +  2sJ2  R-JmDjm(I  -e~2skjM)-\l  +  eskjMhM)GJM(ljMs)  . 

Me£j 

Equation  (4.11)  when  relabeled  with  edge-indices  is  just  (2.15)  in  the  frequency  do¬ 
main.  Note,  however,  that  we  cannot  use  such  transform  techniques  when  e.g.  dry 
friction  is  present  at  joints.  As  a  result,  (4.11)  appears  as  an  extension  of  the  classi¬ 
cal  Steklov-Poincare-equation  to  networks,  and  (2.15)  as  its  pull-back  into  the  time 
domain.  See  Benamou  [2]  for  a  2-d-problem. 

5.  Control  problems 

5.1.  Minimization  of  energy  in  substructures.  We  go  back  to  (2.15)  and  integrate 
with  respect  to  time  from  0  to  t. 

(5.1)  mjXj(t)  +  l  J2  f^|  Xj(t)  +  2  £  £  (D?ik+1]Xj)(t) 

\ieSj  J  k= 0  ie£ J 

mo  ft  ft 

-2£  J2(D?+1XiM)(t)  =  /  fj(s)ds-J2  euKi  v'i(xu,s)ds  . 
k=o  ie£j  Jo  ie£j  J0 

Note  that  Aj(0)  =  0  if  mj  ±  0  and  Aj(-s)  ^0  s  >  0.  Similar  equations  hold 
for  each  node  Vj.  Note,  however,  that  the  J-th  equation  is  the  only  one  to  contain 
A j(t),  Xj(t )  at  the  actual  time  t.  It  is  readily  seen  that  one  might  eliminate  the  Aj’s 
corresponding  to  nodes  without  extra  masses  mj  from  (2. 15), ((5.1)).  As  a  result,  one 
can  reduce  the  system  of  equations  (5.1)  for  the  indices  J  to  those  having  mj  ^  0. 
Having  solved  that  latter  system  one  can  then  use  the  recurrence  relations  for  all 
those  variables  A k  with  mx  =  0.  This  amounts  to  saying,  that  we  may  w.l.o.g. 
consider  a  reduced  system  with  mj  7^  0  V  J.  That  reduced  system  can  be  recast,  into 
a  standard  format  as  follows. 

1 

(5.2)  A(t)  =  A0N{t)  +  Y,AjMt  -  hi)  +  Bu(t)  +  9(t) 

3= 1 

A(-t)  =0,  t  >  0 

with  Ai,  B  obvious  from  (5.1).  In  (5.2)  the  structure  of  Bu(t)  is  again  given  by 

B0F(t)  +  j2BjF(t-hj)  =  Bu(t) 

i=i 
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where  Fj(t)  =  /  fj(s)  ds.  As  remarked  earlier,  dry  friction  at  some  joints  will  result 

in  a  discontinuous  “right-hand-side”,  of  a  type  mentioned  in  Remark  2.4.  Further, 
g(t)  is  given  by  the  local  Dirichlet  problems  with  fixed  Dirichlet-data  and  corre¬ 
sponding  initial  conditions.  It  is  important  to  note  that  the  number  l  of  delays 
increases  with  the  process  time.  Therefore,  if  we  want  to  solve  (2.15)((5.2))  on  a 
time  interval  (0,  T),  then  we  first  have  to  compute  l  as  the  largest  integer  such  that 
max(2(l+l)li/pi,  2(l  +  l)li/qi  <T  i  =  1, . . .  ,ne.  Then  on  (0,T)  we  have  a  finite  de¬ 
lay  problem  (5.2).  We  may,  thereby,  restrict  solutions  A  of  (5.2)  with  l  fixed,  obtained 
by  semi-group  theory,  to  (0,  T)  in  order  to  solve  our  problem,  while  it  is  not  feasible 
to  use  these  semi-group-solutions  beyond  T.  Keeping  this  in  mind,  we  can  use  the 
powerful  theory  of  functional  /retarded  differential  equations  and  the  corresponding 
control  theory  as  surveyed  for  instance  in  Kappel  [7] .  On  the  numerical  side,  we  can 
either  use  approximation  methods  as  surveyed  als  in  [7]  or  the  very  recent  parallel 
approach  outlined  in  Burrage  [1].  We  do  not  have  sufficient  space  to  dwell  on  this  any 
further.  The  main  purpose  of  this  section  is  to  reformlate  the  problem  of  minimizing 
the  energy  of  a  specified  subsystem,  in  terms  of  the  nodal  variables  A.  In  the  edge  i 
we  have  the  local  “total”  energy: 

Ei{t)  :=  [\ri(x,t)\2 +  Kir'i(x,t)-r'i{x,t)}  dx 

+^rnj\Xj(t)\2  +  ^rnM\XM(t)\2  :=  Ei0(t)  +  En(t ) 

Using  (2.1)-(2.4)  and  the  results  of  Section  2  we  obtain 
d> 

— ui0(t)  =  tijKi^Xijt)  +  rj'^xu,  t ))  •  A j{t)  +  eiMKiiipfaiM,  t)  +  t ))  •  XM{t) 

=  eijKiipKxuit)  ■  A j(t)  +  CiMKiip'^XiM^)  •  A M(t) 

+  tuKiti^xu,  t)  •  A j{t)  +  eiM Kip'^xm ,  t )  •  A M(t)  . 


However,  by  (2.14)  we  have 


me  mo 

KMxu,  t)  =  eu(PMt)  +  2  £(A2(fc+1)A j)(t)  -  2  J2(D2k+1XiM)(t)), 

k= 0  fc=0 

and  hence,  upon  integration,  it  is  seen  that  Ei(T)  is  a  quadratic  functional  in 
Xj{t),  Am(0  and  their  histories.  In  particular,  set  n  =  2-nv,  M 2  =  Mn  x  L2{—h,  0;  M") 
where  0  =  h0  <  h\  <  . . .  <  hi  =:  h.  Then,  using  the  notation  from  (5.2)  we  can 
define  the  operator  A  in  M 2  by 

D(A)  =  {(/,  01)  G  MV  g  H\-h,  0;  MV  =  ^(0)} 

M 

14'  =  Y.A4H-h,) 

3=0 
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see  Kappel  [7].  Obviously,  with  z(t)  :=  (z°(t),z1(t))T, 
z°(t)  =  0)=0 

3=0 

z\t,s )  =  (^'(M),  ^(t.O)  =  *“(*),  z1(0,s)=0«G[-/i)0] 

=>  z'ft,  s)  =  z°(t  +  s ) 

• 

3=0 

Therefore,  our  system  (5.2)  can  be  written  in  the  standard  format 

f  z(t)  =  Az{t)  +  Bu(t)  +  g(t)  te[0,T] 

\  z( 0)  =  z0 

We  have  thus  shown,  that  solving  the  problem  of  minimizing  the  vibrational  energy 
on  a  substructure  of  an  elastic  network  of  strings  reduces  to  a  finite-horizon  LQR- 
problem  for  a  system  of  delay  differential  equations  of  a  simple  form  with  zero  initial 
history.  As  there  are  no  continuous  delays  involved,  one  can,  in  fact,  always  proceed 
with  a  step-by-step  procedure  involving  the  solution  of  ODEs  only.  It  is  also  possible 
to  account  for  the  unbounded  delay  case  directly.  Then  one  rewrites  the  system  into 
a  system  of  ordinary  Volterra-integrodifferential  equations  in  the  Stieltjes-sense  with 
monotone,  piecewise  constant  kernels.  This  setup  is  more  suitable  when  discussing 
the  longtime  behaviour  of  the  system,  e.g.  when  considering  an  infinite  horizon  LQR- 
prohlem. 

As  a  result,  by  the  decomposition  method  outlined  so  far 

•  we  are  able  to  reduce  an  infinite  dimensional  -  possibly  nonlinear  -  control 
problem  to  a  finite  dimensional  one,  without  any  kind  of  approximation 

•  we  can  solve  the  delay  system  and  its  adjoint  occuring  in  the  optimality  condi¬ 
tions  in  parallel  using  methods  from  [16]  to  obtain  the  optimal  nodal  positions 

•  we  can  solve  the  local  PDE’s  on  each  edge  in  parallel  for  given  nodal  positions. 

The  task  of  putting  this  program  into  numerical  algorithms  is  on  its  way. 

5.2.  Controlling  the  energy  flux.  The  flux  of  engergy  in  an  edge  is  given  by 
-Ky^xA)  ■  h{x)  and  is  equal  to  the  energy  transported  across  the  section  at  x 
in  the  positive  erdirection,  with  respect  to  a  time-unit.  Hence,  at  a  node  vj, 
the  energy  transported  into  the  direction  of  the  incident  edge  is  eijKir'^Xijjt)  ■  A j 
(-KjMr'jM{0,t)  ■  Aj(t)).  Hence,  in  order  to  maximize  (or  minimize)  the  energy  flux, 
say,  along  a  given  forest  with  roots  as  sources  and  sinks  as  leaves,  on  could  take 
squares  of  the  flux  as  a  cost.  The  resulting  problem  would  be  similar  to  those  of 
Section  5.1. 

Let  us  instead  use  the  frequency  domain  approach  of  Section  4.  There,  we  have 
shown  that 

djM(s)  =  -  ajM{s), 

with  aJM{s )  given  by  (4.10),  is  the  complex  amplitude  of  the  wave  running  from 
vj  towards  vM-  The  correspondance  with  the  time-domain  expression  of  the  flux  is 
obvious.  Therefore,  the  problem  of  controlling  the  flux  of  engery  comes  down  to  an 
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equality- constrained  LSQR-problem  or  a  quadratic  programming  problem.  This  ap¬ 
proach  has  been  discussed  in  Benamou  [2]  in  the  context  of  minimizing  the  scattering 
of  waves  incident  at  a  surface  of  an  obstacle. 

Also,  it  is  possible  to  consider  the  problem  of  exact  anihilation  of  waves  -  which  is 
an  analogue  of  the  well-known  anti-sound  problem. 

We  remark  that,  particularily  in  the  context  of  beams,  where  dry  friction  between 
“pin-joints”  becomes  relevant,  one  might  consider  changing  dry  frictional  parameters 
as  variable  structure  controls.  This  is  an  open  field  of  research. 

As  is  amply  demonstrated,  there  is  a  wealth  of  interesting  optimal  control  prob¬ 
lems  which  can  be  reduced  in  dimension  considerably  using  the  proposed  domain- 
decomposition. 

5.3.  Controllability.  Problems  of  exact  controllability  and  approximate  controllabil¬ 
ity  for  non  collinear  2-d-or  3-d-networks  of  strings  and  beams  in  the  absence  of  masses 
mj  have  been  investigated  in  Lagnese,  Leugering,  Schmidt  [9].  The  case  of  such  net¬ 
works  with  additional  joint-masses  mj  remained  open.  Results  in  this  direction  are 
available  only  for  serial  strings  with  such  masses.  See  Schmidt  and  Wei  [14],  Wei  [15] 
and  Hansen,  Zuazua  [5].  As  an  exemplaric  problem  in  both  papers,  a  two-span  string 
system  with  point  mass  in  the  middle  is  considered.  The  comparatively  simpler  anal¬ 
ysis  in  sections  2,  3  makes  it  clear  that,  in  that  examplaric  situation  with  one  extreme 
point  clamped  and  the  other  controlled,  the  waves  originating  from  the  latter  input 
source  will  be  smoothened  by  one  degree  of  regularity  while  passing  the  mass  in  the 
middle.  Therefore,  rough  data  in  the  first  string  (with  one  end  clampded)  cannot  be 
compensated  for  by  control  inputs  at  the  other  extreme.  Indeed,  for  Dirichlet  control 
problems  in  that  context,  it  was  shown  in  [15],  [5]  that  exact  controllability  holds,  if 
more  regularity  is  required  in  the  span  which  is  not  directly  connected  to  a  controlled 
end,  but  rather  reached  by  passing  a  mass.  It  may  appear  remarkable  then,  that 
in  the  Neumann  case  (the  Dirichlet  case  can  be  handled  in  a  similar  fashion)  a  far 
simpler  analysis,  when  compared  with  [15],  [5],  yields  the  same  results,  even  for  non 
collinear  networks!  The  argument  is  quite  simple:  consider  a  two-span  string  system 
with  a  mass  in  the  middle.  For  the  clamped  node  there  is  no  component  in  (2.15) 
and  the  corresponding  A,  say  A at,  is  zero.  Let  vj  be  the  mid-node  with  mass  mj 
and  displacement  A j,  while  the  controlled  node  is  %,  where  a  Neumann  control  is 
applied.  For  the  sake  of  simplicity  and  bevity  we  take  all  constants  equal  to  1. 

Also,  for  simplicity,  let  <pj(t),  )  denote  the  (sum  of)  forces  at  vj,vM  caused 

solely  by  the  initial  data  y~]  ejjKir]'i(xij,  t  in  (2.15);  then  (2.15)  reads  like: 

\i€£  J 

me  _  mp 

+  2A j(t)  +  4  ^2  ~~  2(fc  +  1))  —  2  ^2  —  (2/c  +  1))  =  4>j{i) 

k= 0  fc=0 

me  _  mo 

(5.3)  A^r(i)  +  2  ^  A M(t  —  2 (k  +  1))  —  2  ^  A j(t  -  (2k  +  1))  =  /m(^)  —  <t>M(t) 

k=0  k= 0 

Aj(0)  =  Aj(0)  =  Am(0)  =  0  . 

Now,  given  the  initial  data  for  rq,  r2(l  <-»  NJ,  2  JM)  and  zero  boundary  condition 
at  vN,  we  can  compute  the  solution  r1(x,  1),  jq(x,  1).  In  the  time  interval  [1,3]  we 
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then  solve  the  Dirichlet  boundary  control  problem  in  the  usual  way,  and  obtain  a 
unique  boundary  control  at  vj  on  [1,3].  This  is  the  A j(t)  required  on  [1,3].  Looking 
closer  at  (5.3),  it  appears  that  A j(t)  given  on  [1,3]  uniquely  determines  fM  on  [0,2], 
Then,  the  Dirichlet  data  A j(t)  on  [1, 2]  and  the  initial  data  of  rur2  will  give  a  solution 
r2(x,2),r2(x,2).  We  take  those  as  initial  data,  and  consider  the  Dirichlet  boundary 
condition  given  by  A j(t)  on  [2, 3]  and  0  on  [3, 4].  On  edge  #2  we,  therefore,  have  again 
the  problem  of  exact  null  controllability  on  the  domain  [0, 1]  x  [2, 4].  The  solution  of 
this  controllability  problem  determines  /m(£)  on  [2,4], 

This  principle  can  be  applied  also  to  different  physical  constants,  and  using  the 
arguments  in  [5]  on  p.  1390,  to  varying  stiffness-problems.  Moreover,  and  more 
importantly,  we  can  show  exact  controllability  results  for  tree-like  networks  with 
joint-masses,  when  all  leaves  are  controlled.  It  is  plain  that  the  regularity  of  initial 
data  has  to  increase  by  one  degree,  each  time  a  mass  has  to  be  passed  while  following  a 
path  to  a  controlled  end.  The  precise  regularity  statement  is,  however,  a  bit  involved 
and  admittedly  of  rather  academic  interest  in  real  applications.  Therefore,  if  we  do 
not  insist  on  sharp  regularity  requirements  for  the  initial  data,  we  can  state  the  result 
in  the  following 

Theorem  5.1.  Let  the  initial  data  be  sufficiently  smooth.  Let  G  be  a  tree  with  the  root 
vroot  damped.  Let  all  simple  nodes  (other  than  the  root)  be  controlled  by  L2(0,T)- 
Neumann  controls,  where  T  >2*  dist{vrooUG).  Then  the  correpsonding  network  of 
elastic  strings  (1.1-1. 5)  with  masses  at  the  joints  is  exactly  controllable. 

As  was  shown  in  [11],  exact  controllability  of  a  4-node  star-graph  with  one  node 
clamped  and  only  one  other  simple  node  controlled  holds  for  a  massless  multiple 
joint.  Now  instead  we  allow  for  a  mass  at  the  coupling  node  vj,  and  consider  the 
two  uncontrolled  strings  connected  at  vj  through  a  mass  mj.  Assume  that  one  of 
the  strings  is  clamped  at,  say  v at,  while  the  other  is  free  at,  say  Vm •  The  two  strings 
satisfy,  in  addition  to  appropriately  regular  initial  conditions,  a  continuity  condition 
at  vj,  the  nodal  displacement  being  A j.  The  resulting  subsystem  is  generically  exactly 
controllable  by  an  fJ02_(0,  T)-in-span-control  (for  H2( 0,h)  x  Hl(Q,k)- initial  data). 
Once  that  Dirichlet-control  is  identified  with  A j{t)  on  (0,T),  the  controllability  from 
the  fourth  node,  say  vc,  by  a  Neumann-control  follows  as  in  the  case  above.  Iterating 
this  procedure,  we  can  prove  exact  controllability  of  e.g.  a  serially  connected  (not 
necessarily  collinear  though)  string  with  one  extreme  clamped,  the  other  extreme 
controlled  in  the  Neumann-data,  and  with  further  uncontrolled  strings  attached  to  all 
interior  nodes,  provided  the  boundary  data  of  those  attached  strings  are  of  Neumann- 
type.  If  we  do  not  attach  strings  to  all  interior  nodes,  controllability  (given  identical 
elements)  will  depend  on  how  many  nodes  (an  even  or  odd  number  of  nodes)  are  left 
out  in  a  row.  A  detailed  analysis  goes  beyond  the  scope  of  this  paper  and  would  have 
to  be  related  to  Ho’s  work  [6] . 
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On  a  Weakly  Damped  System  Arising 
in  the  Control  of  Noise 
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Abstract.  We  consider  a  simplified  linear  hybrid  system  for  the  problem  of  the  control 
of  noise  in  a  cavity,  consisting  of  two  coupled  wave  equations  of  dimensions  two  and  one 
respectively.  A  dissipative  term  is  assumed  to  act  in  the  one-dimensional  equation.  We 
prove  the  existence  and  the  uniqueness  of  solutions.  Each  trajectory  is  proved  to  converge 
to  an  equilibrium  as  t  — >  oo.  On  the  other  hand  we  show  that  the  convergence  rate  of  the 
energy  is  not  exponential.  The  proof  of  this  result  uses  a  perturbation  argument  allowing 
to  modify  the  boundary  conditions  so  that  separation  of  variables  applies. 

1991  Mathematics  Subject  Classification.  35B35,  35L05,  35P05,  73K70 

Key  words  and  phrases.  Hyperbolic  system,  stabilization,  decay  rate,  spectral  analysis, 
aeromechanic  structure  interaction. 


1.  Introduction  and  the  mathematical  model 

In  this  paper  we  study  a  simplified  model  for  the  problem  of  the  active  control  of 
noise,  introduced  in  [3],  consisting  of  a  two-dimensional  interior  cavity  with  a  flexible 
boundary.  The  acoustic  vibrations  of  the  fluid  which  fills  the  cavity  are  coupled  with 
the  mechanical  vibrations  of  a  string  with  fixed  ends  (Dirichlet  boundary  conditions) 
located  on  the  boundary  of  the  cavity.  This  constitutes  a  hybrid  system  since  two 
vibrations  of  different  nature  interact.  For  other  examples  of  hybrid  systems,  such  as 
those  coupling  strings  or  beams  with  rigid  bodies,  see  [14]  or  [9] . 

Let  us  describe  in  more  detail  the  mathematical  model  we  shall  study. 

We  consider  the  two-dimensional  cavity  Q  =  (0, 1)  x  (0, 1)  filled  with  an  elastic, 
inviscid,  compressible  fluid,  in  which  the  acoustic  vibrations  are  coupled  with  the 
mechanical  vibration  of  a  string  located  in  the  subset  r0  =  {(x,  0)  :  x  £  (0, 1)}  of  the 
boundary  of  Q. 

To  describe  the  acoustic  wave  motion  let  v  be  the  velocity,  p  the  pressure  and  p 
the  density  of  the  fluid  in  our  domain.  Also,  we  consider  that,  at  rest,  the  pressure 
p0  and  the  density  p0  are  constant.  The  linearized  equations  for  the  propagation  of 
sound  in  an  inviscid,  elastic  and  compressible  fluid,  describing  small  disturbances, 
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are  (see  [11]): 
(1.1) 


p'  +  pa  div  v  =  0  in  fi  x  (0,  oo) 
Pqv'  +  Vp  =  0  in  x  (0,  oo). 


We  denote  by  '  the  time  derivative. 

Let  W  be  the  transversal  displacement  (in  the  plane  of  Q)  of  the  string  which 
is  assumed  to  be  dissipative  and  with  Dirichlet  boundary  conditions.  The  interior 
pressure  p  of  the  fluid  is  acting  on  the  string: 

,10»  j  W" -Wxx  +  W' =  p-p0  on  r0  X  (0, oo) 

\  W( 0)  =  W(  1)  =0  for  t  £  (0,  oo). 

On  r0  we  impose  the  condition  of  continuity  of  the  velocity  fields  which  results  from 
the  assumption  that  the  string  is  impenetrable  to  the  fluid.  The  part  I\  =  dQ  \  T0 
of  the  boundary  of  Q  is  rigid  and  impenetrable,  thus  leading  to  zero  normal  velocity. 
We  obtain  the  following  boundary  conditions: 

(  v  ■  v  =  0  on  Ti  x  (0,  oo) 

(■*-•3)  j  v  ■  v  =  W'  on  r0  x  (0,  oo). 


By  v  we  denote  the  outward  unit  normal  to  the  boundary. 

In  studying  sound  waves,  it  is  usual  to  assume  that  p  =  /(p).  In  the  case  of  small 
perturbations,  we  can  consider  that  the  relation  between  p  and  p  is  linear  (see  [11]): 

(1.4)  P  -  Pa  =  c20(p  -  p0) 

where  c0  is  the  speed  of  sound  in  our  fluid. 

We  obtain  the  following  system  in  v ,  p  and  W : 

■  +  Vp  =  0  in  ft  x  (0,  oo) 

p1  +  divu  =  0  in  flx(0,  oo) 

v  ■  v  —  0  on  Ti  x  (0,  oo ) 

,  ,  v  ■  v  =  W'  on  F0  x  (0,  oo) 

^  ‘  ‘  \  W"  -Wxx  +  W'  -p  =  Q  on  T0  x  (0,  oo) 

W(0,t)  =  W{l,t)  =  0  for  te(  0,oo) 

u(0)  =  v°,  p{  0)  =  p°  in  Cl 

W{  0)  =  W°,  W'{  0)  =  W1  on  T0. 

Observe,  in  particular,  that  the  string  is  fixed  at  its  ends  x  =  0, 1. 

For  simplicity  we  have  modified  slightly  the  model  introduced  in  [3]  considering  on 
the  boundary  an  active  string  instead  of  an  Euler-Bernoulli  beam.  Nevertheless  all 
the  techniques  we  use  here  can  be  easily  adapted  to  that  case  and  similar  results  can 
be  proved. 

We  define  the  energy  associated  with  this  system  by: 

(1.6)  E(t)  =  l  [  (I  w  I2  +p2)  +  \  /  ((wy2  +  (WO2). 

Z  JQ,  Z  J  To 

In  (1.5),  for  simplicity,  we  have  normalized  all  the  constants  to  unity.  If  this  is  not 
done  one  has  to  change  in  a  convenient  way  the  definition  of  the  energy  but  the  main 
results  of  this  paper  remain  valid. 
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The  system  has  a  dissipative  nature.  Indeed,  multiplying  in  (1.5)  the  first  equation 
by  v,  the  second  equation  by  p,  the  fifth  equation  by  W'  and  integrating  by  parts, 
we  get,  formally,  that: 

dE{t)/dt  =  -[  (W')2<  0. 

The  aim  of  this  article  is  to  study  the  effect  of  the  damping  term,  which  is  con¬ 
centrated  in  the  string  equation,  on  the  asymptotic  dynamics  of  the  whole  system. 
We  shall  prove  that  the  dissipation  can  force  the  strong  stabilization  but  it  cannot 
ensure  an  uniform  decay  rate. 

We  remark  that  this  result  is  not  surprising  in  view  of  the  structure  of  the  damping 
region.  Indeed,  as  Bardos,  Lebeau  and  Rauch  prove  in  [6],  in  the  context  of  the  control 
and  stabilization  of  the  wave  equation  in  bounded  domains,  if  one  characteristic  ray 
escapes  to  the  dissipative  region  we  can  not  expect  an  uniform  decay  to  hold  (see 
also  Ralston  [21]).  In  our  case  each  segment  {(a:, a),  x  E  (0, 1)},  0  <  a  <  1,  is  such 
a  ray  and  therefore  the  decay  rate  may  not  be  uniform. 

Nevertheless,  in  our  problem,  the  lack  of  uniform  decay  is  fundamentally  due  to 
the  hybrid  structure  of  the  system.  Indeed,  the  nature  of  the  coupling  between 
the  acoustic  and  elastic  components  of  the  system  (i.e.  the  boundary  conditions 
on  T0)  allows  to  build  solutions  with  arbitrarily  slow  decay  rate  and  with  the  energy 
distributed  in  all  of  the  domain  and  not  only  along  some  particular  ray  of  geometrical 
optics  as  in  [21], 

B.  P.  Rao  in  [22]  has  shown  that,  in  various  one-dimensional  hybrid  systems,  the 
coupling  is  such  that  the  damping  term  is  a  compact  perturbation  of  the  underlying 
conservative  dynamics.  This  kind  of  arguments  does  not  apply  in  our  problem,  since 
we  are  in  space  dimension  two.  Actually,  in  [17],  we  have  proved  that,  in  a  similar 
system,  the  difference  between  the  semigroup  generated  by  the  dissipative  system 
and  the  one  generated  by  the  corresponding  conservative  system  is  not  compact. 

Let  us  mention  that  a  similar  problem,  in  which  Neumann  boundary  conditions  are 
considered  for  the  string,  was  studied  in  detail  in  [16]  and  [17],  Prom  the  mathematical 
point  of  view  this  case  is  easier  since  it  allows  us  to  separate  the  variables  and  to 
obtain  explicit  informations  about  the  eigenvalues  and  eigenfunctions  of  the  system. 
In  this  way  we  have  showed  that  there  exists  a  sequence  of  eigenvalues  approaching 
the  imaginary  axis  at  high  frequencies  and  that  the  corresponding  eigenfunctions 
have  the  property  that  the  energy  concentrated  in  the  string  vanishes  asymptotically. 
This  implies  that,  although  all  solutions  tend  to  zero  when  the  time  goes  to  infinity, 
the  decay  rate  is  not  uniform. 

In  [1]  and  [2]  the  strong  stability  of  the  following  system  is  studied: 


'  -  A<P  =  0 

<P  =  0 

d<&/dv  +  a®  =  W' 

,  W"  +  a2W  +  A2W'  +  =  0 

W  =  dW/dv  =  0 
$>(0)  =  $°,  $'(0)  =  fc1 

W{  0)  =  W°,  W'(0)  =  W1 


in  D  x  (0,  oo ) 
on  7  \  70  x  (0,  oo) 
on  70  x  (0,  oo) 
on  70  x  (0,  oo) 
on  ckyo  x  (0,  oo) 
in  D 
on  7o, 


(1.7) 
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where  D  is  a  bounded  open  subset  of  Mn  with  Lipshitz  boundary  7,  70  is  a  segment 
of  7  and  a  >  0. 

Observe  that,  since  we  are  dealing  with  an  irrotational  fluid,  the  velocity  v  and 
pressure  p  can  be  written  in  terms  of  a  potential  <J>:  v  =  V$  and  p  =  — When 
doing  this,  system  (1.5)  can  be  rewritten  follows: 


<h"  -  A$  =  0 

in 

Q  x  (0, 00) 

d$/dv 

=  0 

on 

Ti  x  (0, 

00) 

d$/dv 

=  W' 

on 

T) 

O 

X 

00) 

W"  -  Wxx  +  W'  +  =  0 

on 

cT 

X 

0 

00) 

wx(  0)  = 

=  wx(  1)  =  0 

for 

t  £  (0,  00) 

*(0)  = 

$0,  $'(0)  =  fc1 

in 

n 

W(  0)  = 

=  W°,  W'(  0)  =  w1 

on 

r0. 

Let  us  point  out  some  of  the  differences  between  systems  (1.7)  and  (1.8).  First  of 
all  observe  that  the  potential  $  is  assumed  to  vanish  on  the  rigid  subset  7  \  70  of 
the  boundary.  This  simplifies  the  set  of  equilibria  of  the  system  that,  in  this  case,  is 
reduced  to  (<t>,  W)  =  (0, 0).  However,  the  condition  <l>  =  0  on  7  \  70  does  not  seem 
to  be  realistic.  On  the  other  hand  the  continuity  condition  on  the  velocity  fields  has 
been  modified.  Indeed,  the  condition 

d$/dv  =  W\ 


has  been  replaced  by: 

d$/dv  +  a&  =  W',  a  >  0. 

These  boundary  conditions  introduce  an  extra  dissipation  on  the  system,  since 

=  |VW'|2  —  a  f  |$'|2. 

dt  J  70  •'70 

Moreover,  the  displacement  W  is  assumed  to  satisfy  a  strongly  damped  plate  equation 
whose  principal  part  W"  +  A2W  +  A2VF/  is  known  to  generate  an  analytic  semigroup. 
In  this  sense,  this  problem  is  different  from  ours.  An  analogous  model  in  which  the 
strongly  damped  plate  equation  is  replaced  by  W"  —  Wxx  —  W'xx  +  $'  =  0  and  a  =  0 
has  been  analyzed  in  [18]. 

In  [1],  taking  a  >  0  and  70  sufficiently  large,  the  exponential  stability  result  is 
proved  by  using  multipier  techniques. 

The  rest  of  the  paper  is  organized  as  follows. 

In  Section  2  we  present  an  abstract  formulation  of  the  problem  and  we  give  a 
result  of  existence,  uniqueness  and  stability  of  solutions.  Since  we  are  dealing  with 
a  linear  system  all  these  results  are  direct  consequences  of  the  classical  theory  of 
maximal-monotone  operators. 

The  asymptotic  properties  of  the  solutions  are  studied  in  Sections  3  and  4. 

In  Section  3  we  prove  the  convergence  of  each  solution  of  the  system  to  an  equi¬ 
librium  point  uniquely  determined  by  the  corresponding  initial  data.  We  do  this 
using  classical  techniques  involving  La  Salle’s  Invariance  Principle  and  Holmgren’s 
Uniqueness  Theorem. 

The  rate  of  the  convergence  to  the  equilibrium  is  studied  in  Section  4.  We  prove 
that  the  decay  rate  is  not  uniform.  In  order  to  do  this  we  start  from  the  observation 
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that  the  same  property  is  true  for  the  system  with  Neumann  boundary  conditions 
for  the  string  and  next  we  use  the  fact  that  the  difference  between  these  two  systems 
is  negligible  at  high  frequencies. 

2.  Existence  and  uniqueness  of  solutions 

We  define  the  space  of  finite  energy  corresponding  to  (1.5)  by: 

X0  =  £  X  L2(Q)  x  H^To)  X  L2(r0), 

C  =  ju  e  L2(Cl)  x  i2(Q)  :  curl  v  =  oj  = 

=  j»-=  (vi.tfc)  €  L!(Si)  x  L!(S!)  :  jf  -  ^v(j  =  0,  V»>  £  C~(!J)}  . 

Remark  1.  Observe  that  v  G  jC  if  and  only  if  there  exists  a  function  $  G  H1(Q.)  such 
that  V$  =  v. 

Xq  with  the  natural  inner  product  is  a  Hilbert  space. 

We  define  in  X0  the  unbounded  operator  (V(A),  A)  in  the  following  way: 

A(v,  p,  W,  V )  =  (Vp,  div  v,  -V,  -Wxx  +  V  -  p), 

V{A)  =  {U  =  {v,p,W,V)  G  Xo  :  A(U)  G  X0,  v-u  =  0  on  Tu  v-v  =  V  on  F0}  . 

Remark  2.  Let  (v,p,W,V)  G  'D(A).  Observe  that  divv  G  L2(Q)  and  v  G  C  imply 
that  there  exists  $  G  H1^)  with  V$  =  v  such  that  A<f>  G  L2(fi).  Since,  in  addition, 
we  have  v  ■  v  =  0  onY\  and  v  ■  v  =  V  on  To  we  obtain  that 

I  A $GL2(Q) 

|  d$/dv  =  0  on  r1;  d$/dv  =  V  G  Hq(F0)  on  F0. 

Since  0  is  convex  it  results  that  $  G  iF2(fi)  (see  [8],  Theorem  5. 1.3. 5,  p.  263).  It 
follows  that  V(A)  C  (Hl(0))2  x  Hl(Q)  x  H2(r0)  D  x  H%( F0)  and  therefore 

V(A)  is  compact  in  X0. 

We  can  consider  now  the  following  abstract  Cauchy  formulation  of  (1.5): 
f  U'  +  AU  =  0 

(2.1)  U(0)  =  U0 

[  U(t)  =  (v,p,W,W')(t)eV(A). 

First,  we  have  a  classical  result  of  existence,  uniqueness  and  stability  for  the  system 

(2.1) .  The  terminology  we  use  is  the  same  as  in  [7]. 

Theorem  2.1.  i)  A  is  a  maximal  monotone  operator  in  X0  generating  a  strongly 
continuous  semigroup  of  contractions,  {5(t)}t>o,  in  X0. 

ii)  Strong  solutions:  If  U°  =  W°,  W1)  G  V(A)  then  there  exists  a  unique 

strong  solution  S(t)U°  =  U  G  C([0,oo),Z>(*4))  flC1([0,oo), Xq)  of  (2.1). 

in)  Weak  solutions:  If  U°  =  (v°,p°,  WQ,  W1)  G  X0  then  there  exists  a  unique 
solution  S{t)U°  =  U  G  C([0,  oo),T0)  of  (2.1). 
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For  any  weak  solution,  the  associated  energy  (l.G)  satisfies: 

‘2-2>  f  = 

Proof.  We  prove  first  that  the  operator  A  is  maximal  monotone  in  X0. 

Indeed,  if  U°  =  {v°,p°,  W°,  W1)  e  V(A)  then  (AU°,  U°)<  -  f  (W1)2  <  0,  which 

•/ro 

means  that  A  is  monotone. 

On  the  other  hand,  for  all  F  =  (A,  A,  A,  A)  €  X0  we  can  find  a  unique  solution 
U  =  (v,p,  W,  V )  e  V(A)  for  the  equation  (A  +  I)U  =  F.  This  is  equivalent  to  solve 
the  following  system: 

Vp  +  v  =  f[ 

,0  o\  divu  +p  =  A>  v  ■  v  =  0  on  Tj  and  v  ■  u  =  V  on  To 

V  +  W^f3 

.  -Wxx  +  V-p  +  V  =  UmdW(0)  =  W(l)  =  0. 


First,  we  consider  the  variational  formulation  of  (2.3),  which  consists  in  finding 
(p,W)  in  H1^)  x  Hq(T0)  such  that,  for  all  (<p,u)  e  H 1(fi)  x  Hq(T0)  : 

f  Vp  ■  V<p  +  f  PF+  [  W<p  +  f  Wxux  -  [  pu  +  2  f  Wu 
,n  ,s  Jci  Jn  J r0  ^r0  Jr0  Jr0 

(2-4)  r  r  f  r 

=  /  /i‘V<p+  /  f2<p+  /  h<P  +  /  (A  +  2/3)11. 
jq  ./fi  Jr0  «/r0 

The  left  side  of  the  equation  (2.4)  defines  a  continuous  and  coercive  bilinear  form 
in  (Ffl{Fl)  x  H\Vq))2  while  the  right  side  defines  a  continuous  linear  form  in  x 

Hl(T0). 

Applying  Lax-Milgram’s  Lemma  it  results  that  (2.4)  has  a  unique  solution  (p,  W) 
in  Hl{h)  x  Ho(T0).  Finally,  in  view  of  the  classical  regularity  results  for  Laplace’s 
operator,  this  implies  that  A  +  I  is  maximal. 

Since  the  operator  A  is  maximal  monotone  in  Xq  we  can  apply  the  Hille-Yosida 
theory  (see  [7],  Theorem  3.1.1,  p.37)  and  obtain  the  stated  results.  □ 


3.  Strong  stabilization 

Concerning  the  asymptotic  behavior  of  solutions  we  prove  first  the  following  theo¬ 
rem. 

Theorem  3.1.  For  each  initial  data  U°  =  (v°,p°,  W°,  W1)  in  Xq  the  corresponding 
weak  solution  of  (2.1)  tends  asymptotically  towards  the  equilibrium  point  (0 ,b,ba(x),  0) 

where  b=^-(  [  p°  +  [  W0>)  and  a(x)  =  -( —x 2  +  x). 

13  \Jn  JTo  /  2 


Remark  3.  We  obtain  that  the  velocities  of  the  fluid  and  the  string  go  to  zero  whereas 
the  pressure  of  the  fluid  and  the  position  of  the  string  tend  to  some  functions  that  are 
uniquely  determined  by  the  initial  data.  Notice  that  the  pressure  stabilizes  around  a 
suitable  constant  while  the  asymptotic  deformation  of  the  string  is  a  parabola. 
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Proof.  The  main  tools  of  our  analysis  are  an  extension  of  the  well  known  Invariance 
Principle  of  La  Salle  and  Holmgren’s  Uniqueness  Theorem. 

Observe  first  that  it  is  sufficient  to  consider  only  initial  data  U°  =  (v°,p°,  W°,  W1) 
in  T>{A).  A  standard  density  argument  and  the  property  of  stability  (2.2)  enable 
us  to  complete  the  proof.  In  this  case  Theorem  2.1  gives  an  unique  strong  solution 
U(t)  =  (v,p,W,W')(t)  =  S(t)U°  for  the  equation  (1.5),  with  {U(t)}t>o  bounded  in 
'D(A).  Since  T>(A)  C  X0  with  compact  inclusion,  we  have  that  {U(t)}t>o  is  relatively 
compact  in  X0. 

We  now  describe  the  equilibrium  points  corresponding  to  our  problem.  These  are 
elements  Z  =  ( u,r,X,Y )  e  L>(A)  with  S(t)Z  =  Z  for  all  t  >  0.  It  follows  that  the 
equilibrium  points  are  characterized  by  the  system: 


Vr  =  0 

in  Q 

div  u  =  0 

in  fl 

u  ■  v  =  0 

on  T( 

Xxx  r  =  0 

on  r. 

d 

II 

o' 

* 

II 

T— 1 

Prom  (3.1)  we  deduce  that  the  equilibrium  points  are  (0,  b,  ba(x),  0),  where  b  is  a 
real  constant  and  a(x)  is  the  solution  of  the  differential  equation: 

/  -axx  -1=0,  x  G  (0, 1) 

\  o(0)  =  o(l)  =  0. 

On  the  other  hand  we  remark  that  the  energy  function  defined  by  (1.6)  is  a  Lya¬ 
punov  function  for  the  dynamical  system  defined  by  S(t)U°  =  U(t)  since  it  satisfies 
relation  (2.2).  We  prove  now  that  E(t)  is  a  strict  Lyapunov  function.  To  do  this  let 
Z°  =  (u°,  r°,  X°.  Y°)  €  X0,  Z(t)  =  (u,  r,  X,  Y)(t)  =  S(t)Z°  for  all  t  >  0  and  suppose 
that  the  energy  of  the  solution  Z(t)  is  constant.  Hence  Y(t)  =  0,  by  (2.2). 

It  follows  that  (u,  r,  X,  Y)  satisfies: 


v!  +  Vr  =  0 

in  Six  (0, oo) 

r'  +  div  u  =  0 

in  fl  x  (0,  oo) 

(3.2) 

u  ■  v  =  0 

on  dQ,  x  (0,  oo) 

I  Xxx  r  0 

on  To  x  (0,oo) 

O 

II 

HO 

II 

Ho' 

o' 

for  t  G  (0,  oo). 

Therefore: 

O 

II 

<1 

1 

X 

in 

(3.3) 

<  dr/dv  =  0 
(  r'  =  0 

on  dQ 

on  r0. 

We  can  apply  now  Holmgren’s  Uniqueness  Theorem  (see  [10],  Theorem  8.6.5,  p. 
309  and  [12],  Theorem  8.1,  p.  88)  which  implies  that  r’  =  0  in  U  x  (l,oo)  and  so 
r(t, x, y )  =  r{x, y)  infix  (1, oo). 

Prom  (3.3)  we  can  deduce  that  r  =  b  in  Q  x  (1,  oo)  where  b  is  a  real  constant. 
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Moreover,  from  (3.2),  it  follows  that  u  =  0  in  ft  x  (1,  oo)  and  X  is  solution  of  the 
equation: 

-Xxx  -6  =  0  on  T0  x  (1,  oo) 

X(0,i)  =  X(M)  =  0  for  t  E  (1,  oo). 

Taking  into  account  the  uniqueness  of  solutions  of  the  system  (3.2)  we  obtain  that 
Z°  =  (u0,r°,X°,Y0)  =  (0, 6,  ba(x),  0).  Hence  is  an  equilibrium.  Therefore  E(t)  is 
a  strict  Lyapunov  function. 

We  are  now  in  conditions  to  apply  La  Salle’s  Invariance  Principle. 

Let  now  U°  =  {v°,p°,  W°,  W1)  be  the  initial  data  for  (1.5).  By  La  Salle’s  Invariance 
Principle  it  follows  that  the  trajectory  tends  to  the  set  of  the  equilibrium  points  when 
the  times  goes  to  infinity.  Let  us  prove  that,  in  fact,  the  trajectory  converges  to  a 
unique  point. 

Integrating  the  second  equation  of  (1.5)  in  ft  we  deduce  that  the  quantity 
f  W°  is  constant  along  the  trajectory.  Since  the  equilibrium  points  are  of  the  form 

J r0 

(0,b,ba(x),0)  it  follows  that  the  corresponding  solution  of  (1.5)  tends  to  an  unique 
equilibrium  point,  the  one  for  which  6  =  —  (JQP°  +  jT  W"0)  .  □ 

Remark  4.  We  can  decompose  the  space  Xq  as  Xq  =  Xg  ©  Xg  ,  where: 

Xg  =  {(uV3,  w°,  V°)  ex0  :fnp°  +  Jr0  =  o}  , 

Xg  =  |(0,  b,ba(x),  0)  e  X0,  b  G  R}  . 

The  projection  of  the  solution  U{t)  of  (1.5)  on  Xg  is  a  constant  function  in  time 
whereas,  by  Theorem  3.1,  the  projection  on  Xg  tends  to  zero  as  t  goes  to  infinity. 

4.  The  lack  of  uniform  decay 

In  this  paragraph  we  prove  that  the  rate  of  decay  is  not  uniform.  Results  like  this 
are  typical  for  linear  hybrid  systems  in  which  the  dissipation  is  very  weak:  it  can 
force  the  strong  stabilization  but  it  cannot  ensure  the  uniform  decay. 

First  of  all  we  recall  that  a  strongly  continuous  semigroup  {S(t)}t> o  has  exponential 
decay  if  there  are  two  constants  u>  >  0  and  M  >  0  such  that 

(4.1)  ||5(t)||  <  M  exp  (— wf),  W  >  0. 

We  also  remark  that,  in  the  case  of  linear  semigroups,  the  exponential  decay  is  equiv¬ 
alent  to  the  uniform  decay.  Therefore,  if  a  linear  semigroup  {S'(i)}t>o  does  not  have 
exponential  decay  then  there  are  initial  data  U°  such  that  S(t)U°  decays  arbitrarily 
slowly  to  zero.  More  precisely,  if  tp  :  [0,  oo)  — >  oo  is  a  continuous  decreasing  func¬ 
tion  such  that  ip(t)  — >  0  as  t  — >  oo  then  there  exist  an  initial  data  U°  G  X0  and  a 
sequence  (tk)k>o  tending  to  infinity  such  that  ||S,(tfc)170||  >  tp{tk)  (see  [13]). 

When  the  Dirichlet  boundary  conditions  of  W  in  (1.5)  are  replaced  by  the  Neumann 
ones,  i.e.  if  W  is  assumed  to  satisfy 

Wx(0,t)  =  144,(1,0)  =  0,  t>0 


S.  Micu  and  E.  Zuazua 


215 


this  result  is  easy  to  show.  Indeed,  under  these  boundary  conditions  one  can  find  a 
sequence  of  solutions  {{iT„,pn,  Wn)}neN  of  the  type  {vn,pn,Wn)  =  e~Xnt(vn,p„,  Wn) 
in  separated  variables  such  that  TZeXn  — >  0  as  n  — ►  oo  (see  [17]).  However,  the  sepa¬ 
ration  of  variables  does  not  apply  with  the  boundary  conditions  we  are  considering. 

In  order  to  prove  that,  for  our  system,  there  is  no  uniform  decay  we  analyze  first 
a  conservative  problem.  Next,  using  the  fact  that  these  two  systems  are  very  close 
one  from  another  (in  a  way  that  we  shall  make  precise  later  on)  we  prove  the  desired 
property. 

We  consider  now  the  following  undamped  system  in  v,  p  and  W : 


'  if +  Vp  = 

0 

in 

fl  x  (0,  oo) 

p'  +  div  v 

=  0 

in 

n  x  (o,  oo) 

v  •  v  =  0 

on 

Ti  x  (0,oo) 

v-v  =  W' 

on 

r0  x  (o,  oo ) 

W"  -  wxx 

—  p  =  0 

on 

8 

X 

o 

wx(0,t)  = 

144(l,t)  =  0 

for 

t  G  (0,  oo) 

v(0)  =  v°, 

p(  0)  =  P° 

in 

n 

{  w{ 0)  =  W°,  W'( 0)  =  w1 

on 

T0. 

Remark  5.  Since  we  have  dropped  the  dissipative  term  W'  in  the  string  equation  the 
system  ( 4.2)  is  conservative.  On  the  other  hand  we  remark  that  the  Dirichlet  bound¬ 
ary  conditions  for  the  string  have  been  replaced  by  Neumann  boundary  conditions. 
This  will  allow  us  to  use  the  separation  of  variables  and  to  obtain  useful  informations 
about  the  eigenvalues  and  eigenfunctions  of  the  system.  We  do  this  in  Lemmas  4.1 
and  4.2. 

The  initial  data  (v°,p°,  W°,  W1)  is  considered  in  the  space  of  finite  energy: 

(4.3)  X  =  CxL2{tt)xHl{T0)xL2{T0), 

We  define  the  energy  associated  to  this  system  in  the  same  way  as  in  (1.6).  We 
also  define  in  X  the  unbounded  operator  (V(B),  B): 

B(v,p,  W,  V )  =  ( Vp ,  div  v,  -V,  -Wxx  -  p), 

V{B)  =  {U  =  {v,p,  W,V)€X  :  B{U)  G  X,  v  ■  v  =  0  on  Tu 
v  ■  v  =  V  on  To,  Wx( 0)  =  144(1)  =  0}  . 

Lemma  4.1.  The  operator  B  has  a  sequence  of  purely  imaginary  eigenvalues  (A ni)neN 
where  Xn  are  the  roots  of  the  equation: 

(4.4)  f  tan(  =  1. 

Proof.  We  look  for  a  sequence  of  solutions  {(u4,pn>  kkn)}neN  ^or  (4-2)  of  the  type 
(Vn,Pn,Wn)  =  e~Klt(un,rn,vn )  where  un  =  un{y),  rn  =  rn(y)  and  vn  G  M. 
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We  can  see  that  (4.2)  has  solutions  of  this  form  if  (u„,  rn,  vn)  satisfies 

(  —\n  i  un  +  Vrn  =  0  for  y  e  (0, 1) 

, ,  I  -(A n)2rn  ~  ( r„)yy  =  0  for  y  £  (0, 1) 

I  (r„)y(l)  =  o,  (r„)„(0)  =  -(\n)2vn 
{  (A„)2un  +  rn(0)  =  0. 

It  follows  that  rn(y)  =  cos  (A n(y  —  1)),  un  =  -r— rVr„  and  vn  =  ——  sin  A„  solves 

An  l  An 

(4.5)  if  A„  is  solution  of  the  algebraic  equation  (4.4). 

It  is  well  known  that,  for  each  n  e  N,  there  is  a  root  of  this  equation  which  belongs 

to  the  interval  (nix  -  nit  +  0 .  This  concludes  the  proof.  □ 


Remark  6.  A  very  similar  proof  allows  us  to  show  that,  if  in  the  string  equation  in 
system  (4.2 )  we  introduce  the  dissipative  term  W',  there  is  a  sequence  of  eigenvalues 
such  that  lZe  (An)  — >  0  as  n  — >  oo  as  we  mentioned  before.  This  implies  that  the  decay 
rate  of  the  associated  semigroup  is  not  uniform.  In  the  case  of  system  (1.5)  under 
consideration  it  is  difficult  to  show  directly  the  existence  of  such  solutions  since  we 
can  not  use  separation  of  variables. 

Remark  7.  The  roots  (An)„  of  the  equation  (4.4)  have  the  following  asymptotic  de¬ 
velopment: 

1  /  1  \ 

An  =  mr  H - h  O  ,  as  n -r  oo. 

nix  \n3J 

For  details  see  [19],  p.  12. 

To  each  eigenvalue  A ni  given  by  Lemma  4.1  it  corresponds  an  eigenfunction  £n 
defined  by: 

(  t— r  V  cos  (An(y  —  1))  \ 

A  n  1 

c  cos(An(y  —  1)) 

(4-6)  -  sin  An 

A  n 

\  i  sin  A„  / 

We  shall  denote  by  £fn,  j  G  {1, . . . ,  4),  the  components  of  £„. 

Lemma  4.2.  If  (^„)„  is  the  sequence  of  eigenfunctions  of  system  ( 4.2)  corresponding 
to  the  eigenvalues  (A ni)n  given  by  Lemma  4.1  then: 

i)  The  last  two  components  of  \n  tend  to  zero  when  n  tends  to  infinity. 

ii)  The  sequence  (£„)„  does  not  tend  to  zero  in  X  when  n  tends  to  infinity. 


Proof,  i)  Since  An  =  nir  +  —  +  O  ( A)  it  follows  that  (sin  A„), 
'  mx  Vn3/ 

when  n  tends  to  infinity. 


tends  to  zero  in 


ii)  We  simply  remark  that 


-  as  n  oo.  □ 
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Remark  8.  Lemma  4.2  shows  that  there  are  solutions  of  ( 4.2J  in  which  the  effect  of 
the  vibrating  string  vanishes  asymptotically.  This  indicates  that  the  boundary  condi¬ 
tions  for  the  string  are  not  very  important  at  high  frequencies.  Since  the  system  with 
Neumann  boundary  conditions  does  not  have  an  exponential  decay  (see  Remark  6) 
we  can  expect  that  this  will  be  the  case  for  system  (1.5)  too.  Indeed,  as  the  proof  of 
the  following  Theorem  shows,  the  solutions  of  (A2)  can  be  slightly  modified  in  order 
to  obtain  solutions  of  (1.5)  with  arbitrarily  small  exponential  decay  rate. 

The  main  result  of  this  paper  is  the  following: 

Theorem  4.3.  The  decay  rate  of  the  semigroup  {S'(f)}t>o  is  not  exponential  in  the 
space  X§. 


Proof.  We  shall  prove  the  theorem  by  contradiction.  Suppose  that  {5(f)}t>o  has 
exponential  decay  in  Xq.  i.e.  there  are  two  constants  u>  >  0  and  M  >  0  such  that: 

j[5'(i)||^t-o  <  A/  exp  ( — o^t),  Vi  >  0. 

Let  TZ(A  :  p)  be  the  resolvent  of  A  in  p,  H(A  :  p)  =  (A  —  //X)-1,  where  p  is  a 

roc 

complex  number  in  the  resolvent  set  of  A.  We  recall  that  TZ(A  :  p)  =  j  eMt5(i)  dt 
(see  [20],  Theorem  3.1,  p.  8).  Hence 

roo  _  roc 

(4.7)  \\K(A:p)\\xo<Jo  ene^\\S{t)\\xodt<  jf  Me^^dt. 

Since  the  operator  A  is  dissipative  we  have  that  the  resolvent  is  well  defined  from 
X°  to  V(A)  for  all  imaginary  numbers  p  (with  TZep  =  0).  In  this  case  we  obtain 
from  (4.7)  that  the  resolvents  are  uniformly  bounded: 

M 

(4.8)  |[77(.4  :  p)\\x°  <  —  for  all  M  with  TZep  =  0. 

We  shall  prove  that  there  exist  a  sequence  of  imaginary  numbers  (A„  i)n6N,  A„  G  R, 
and  a  sequence  of  functions  ($n)„eN  C  X°,  ll^lUo  =  1,  such  that 


(4.9) 


||77(^4  :  p)$n\\x°  — >  00  when  n  — >  oo. 


This  contradicts  (4.8)  and  the  proof  is  completed. 

In  order  to  do  this  let  (An  i)neN  be  the  sequence  of  eigenvalues  of  the  problem  (4.2) 
given  in  Lemma  4.1  and  let  (£re)neN  be  the  corresponding  eigenfunctions  given  by 
(4.6). 

Observe  that  £n  X°  because  the  third  component,  which  is  a  constant,  does  not 
belong  to  Hq(T0).  We  shall  “cut-off”  this  constant  function  in  order  to  get  a  slightly 
modified  one  in  Hq(T0). 

For  each  n  G  N  we  define  the  function  un  :  [0, 1]  — >  [—1, 1]  by: 


(4.10) 


(-|A„|x  +  l)ep!»"*-i, 

|An|(l-a) 

(|A„|x  -  |A„|  +  l)ei*niu-*>-i, 


0 


[ixe 

otherwise, 
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and  Vn  =  in~  Vv 


and  the  function  hn  as  the  solution  of: 

-A hn  +  hn  =  0  in  Q 

(Jhn  ft  p 

(4.ii)  &7_0  1 

=  (sin  A„)  un  on  T0. 

OP 

Let  now 

/  Vhn  \ 
sin  An  n 

tbn  =  .  and  Tn  =  fn~  ipn- 

rn  smA„ 

A„  Un 

^  —i  sin  A„  un  , 

From  the  definitions  of  the  functions  un  and  hn  it  follows  that  ipn  G  T>{A)  fl  X®  for 
all  n  G  N. 

F,„all,!et<l»„=|[(4_An.I)t)i_|k. 

We  obtain  that: 

^  :  An  *)  =  ||(<A_Antnj)y,n||A?- 

and  we  want  to  prove  that  (4.9)  holds. 

Since  we  need  more  information  about  the  norms  of  ipn  and  (.4  -  A„  il)tpn  we  shall 
prove  first  some  properties  of  the  functions  un  and  hn. 

Lemma  4.4.  The  functions  un  and  hn  defined  by  (4.10)  and  (4.11)  respectively  have 
the  following  properties: 

i)  \K\\h = o 

ii) \\(UnU\h  =  0((Xn)3). 

in)  ||hn||Hi  —  O  ^3^  ' 

iv)  [|Ahn||^2  =  ||h„||^,2  =  O  (  .J\  . 


Hi)  ||h„||K 


Proof,  i)  From  (4.10)  we  obtain 

IKIILcu)  =  2  <  2  jfW(-|A„l*  +  1) 


ii)  We  have  that 


r  ■  a1  ,  1  2]A— n|x 

||(«n)^|||2(0ii)  =  2|An|4^  (|An|x  —  l)4  e'[Xn]X~1 

=  2'An'3  Jo  (s  -1  If  ds  ~ 2c  'An'3’ 


Jo  (s- 1)4  ^  1  ’ 

where  c  =  f  1 — j  ds  is  a  constant  which  does  not  depend  on  Xn. 

Jo  ( s  —  l)4 
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iii)  Prom  (4.11)  we  deduce  that,  for  all  8  >  0, 


IIMIff^fi)  +  ll^nlll^n)  - 


sin  A„  I  un  hn 
JTo 


< 


I  sin  A„,|  (\ 


( T  [  \u„\2  +  8  [  \hn\2') 
\u  JTo  JTo  / 


< 


I  sin  A„ 


OL  \un\2  +  8c\\hn 


Taking  8  = 


l 


|sm  An  I  C 


we  obtain  that 


||^n|||fi(n)  <  c|sinAn|2  [  |un|2. 

*/r0 


Here  c  is  a  generic  positive  constant  that  may  vary  from  line  to  line. 

Since  sinAn  =  O  ^  and  J  \un\2  =  O  by  i),  iii)  follows, 
iv)  We  simply  observe  that 

II  —  Ahn||x,2(n)  =  ||h„||I,2(Q)  <  ||/in||/ji(n) 
and  use  iii).  The  proof  of  the  Lemma  is  now  completed.  □ 

In  order  to  complete  the  proof  of  the  theorem  we  estimate  \\(A- XniI)(fin\\xo  and 
1 1  <pn  |  |*o  when  n  tends  to  infinity. 

Observe  first  that,  by  Lemma  4.4  i),  we  have 


(4.12)  ll^nlU"  >  ||£nlU2(fi) 

On  the  other  hand 


sin  Xn 


Xn  ( 1  1 

—  I  Un  — >  X 

n  Jo  2 


as  n  — >  oo 


/ 


(^4  An  i  T)ipn 


XJhn 


-A hn  +  sin  A„  [  un 
Jo 

0 

A„  sinA„  r1  I 

\  \  \U'n)xx  “1“  ^ n  Sill  An  lLn  J 

\  An  JO  / 


sin  A„ 


We  obtain  that 

\\{A-Xnil)(pn\\2xo  <  ||hn||^i(fi)  4-  2||A/in||22({2) 


+ 


2sin  An 


(l  I  (.U"n)xx  |  lr,2(ro)  +  lluJillL2(r0))  +  4|sin  A„|2||ttn|||2(ro). 


Taking  into  account  the  results  of  Lemma  4.4  and  the  fact  that  sin  A n  =  O 
obtain  that 


we 


(4.13)  \\(A  —  XniI)ipn\fxo  — >  0  when  n — >  oo. 

The  last  result  together  with  (4.12)  contradicts  (4.8).  So  the  assumption  that 
{S{t)} t>0  has  exponential  decay  must  be  false  and  the  proof  is  completed.  □ 
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Remark  9.  Analyzing  the  exponential  stability  of  the  classical  wave  equation  with  dis¬ 
sipation  on  the  boundary 

{u"  —  A u  =  0  in  Q  x  (0,  oo) 

^  +  u'  =  0  on  T0  x  (0,  oo) 
u  =  0  on  Ti  x  (0,  oo) 

Bardos,  Lebeau  and  Rauch  in  [6]  prove  that  if  one  characteristic  ray  escapes  to  the 
dissipative  region  T0  we  can  construct  solutions  with  an  arbitrary  decay  rate  and  with 
the  energy  concentrated  along  this  ray.  In  our  case  every  segment  {(x,yo)  :  x  £ 
(0, 1)},  for  any  y0  £  (0, 1),  constitutes  a  ray  with  such  a  property  and  their  argument 
could  be  applied  as  well. 

Nevertheless  the  proof  of  Theorem  4.3  shows  that  we  can  find  a  sequence  of  solutions 
of  (1.5)  with  the  energy  uniformly  distributed  in  all  Q  and  with  arbitrarily  small 
exponential  decay  rate.  Indeed,  if  ($„)n  is  the  sequence  considered  in  the  proof,  let 
(S(t)$n)n  be  the  sequence  of  corresponding  solutions  of  (1.5).  By  ( 4.7)  we  have  that 

/•OO 

||rcM:ABi)*„||A?<  l  \\Smn\\xodt. 

If  (S(t)$n)n  had  an  uniform  exponential  decay  rate,  for  example, 

||S,(t)$n||A'0  <  Mexp(-u)t),  then 

M 

ll^-("4  :  —  ~ 

which  is  not  true  since  (4$)  holds. 

Therefore  the  lack  of  uniform  decay  of  our  system  is  of  a  different  nature  and  is 
related  not  only  to  the  support  of  the  dissipative  mechanism  but  also  to  the  nature  of 
the  boundary  conditions  or  of  the  coupling  between  the  different  components  of  the 
system. 

Remark  10.  We  mention  that  in  the  proof  of  Theorem  4.3  we  may  start  with  solutions 
(v*n,Pn,  W„)„eN  of  (4.2)  of  the  type  (vn,pn,Wn)  =  e~Kit(un,rn,vn)  cos(mwx)  where 
un  =  un(y),  rn  =  rn(y),  and  an  arbitrary  m  £  N.  Therefore  we  can  find  a 

sequence  of  solutions  of  (1.5)  with  arbirary  exponential  decay  rate  and  with  a  fixed 
frequency  of  vibration  in  the  x—  direction  (m  G  N  fixed).  This  is  due  to  the  fact 
that  the  one- dimensional  problems  obtained  by  separating  the  variable  x  do  not  have 
an  exponential  decay  for  m  fixed.  This  is  an  important  difference  with  respect  to 
system  (4.14)  in  which  the  exponential  decay  holds  if  the  frequency  of  vibration  in  the 
x— direction  is  fixed,  but  with  a  decay  rate  that  vanishes  as  m  — >  oo. 

5.  Comments 

In  [3]  a  two-dimensional  model  is  presented  in  which,  on  the  subset  T0  of  the 
boundary,  an  Euler-Bernoulli  beam  with  fixed  ends  is  considered.  The  methods 
developped  in  this  paper  can  be  adapted  to  this  type  of  problems  too. 

The  results  of  Sections  2  and  3  can  be  generalized  to  similar  models  in  other 
domains.  For  instance,  if  is  a  bounded  open  set  in  R2  with  smooth  boundary  and 
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r0  is  an  open  subset  of  the  boundary  of  the  domain,  one  can  replace  in  (1.5)  the  wave 
equation  satisfied  by  W  by 

d2W 

W"  -  — -  +  W'-p  =  0  on  r0  x  (0,  oo) 

where  is  the  derivative  in  the  tangential  direction. 

The  results  of  Section  4  may  be  extended  to  some  particular  geometries.  For 
instance,  in  [15]  we  analyze  the  case  in  which  fi  is  a  ball  of  M2  and  the  dissipative 
term  acts  on  the  whole  boundary  of  Q.  We  obtain  that  the  corresponding  system 
does  not  have  exponential  decay.  This  indicates  something  we  already  pointed  out 
in  Remark  9:  the  lack  of  uniform  decay  in  this  type  of  systems  is  due  to  the  hybrid 
structure  and  not  to  the  localization  of  the  dissipation  in  a  relatively  small  part  of 
the  boundary.  Although  this  model  does  not  have  much  physical  meaning,  all  the 
techniques  we  used  there  can  be  adapted  to  the  case  of  a  cavity  enclosed  by  a  thin 
cylindrical  shell  which  is  much  more  realistic  (see  [4]). 

Acknowledgments:  The  first  author  wishes  to  thank  all  organizers  of  the  Project 
MATAROU  TEMPUS  JEP  2797  and  especially  Professor  Doina  Cioranescu  for  their 
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Dirichlet  Boundary  Control  of  Parabolic  Systems 
with  Pointwise  State  Constraints* 
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Abstract.  In  this  paper  we  study  an  optimal  control  problem  for  linear  parabolic  systems 
with  pointwise  state  constraints  and  measurable  controls  acting  in  the  Dirichlet  bound¬ 
ary  conditions.  Using  the  framework  of  mild  solutions  to  parabolic  systems  with  nonregular 
dynamics,  we  prove  a  general  existence  theorem  of  optimal  controls  and  derive  necessary  op¬ 
timality  conditions  for  the  state-constrained  problem  under  consideration.  Our  variational 
analysis  is  based  on  a  well-posed  penalization  procedure  to  approximate  state  constraints 
and  then  to  study  a  parametric  family  of  approximating  problems.  The  final  result  estab¬ 
lishes  necessary  optimality  conditions  for  the  original  state-constrained  problem  by  passing 
to  the  limit  from  approximating  problems  under  a  proper  constraint  qualification. 

1991  Mathematics  Subject  Classification.  49K20,  49J20,  35K50,  93C20 

Key  words  and  phrases.  Approximation,  pointwise  state  constraints,  constraint  qualifica¬ 
tion,  Dirichlet  boundary  controls,  parabolic  equations,  and  mild  solutions. 


1.  Introduction 

This  paper  is  devoted  to  optimal  control  of  parabolic  systems  with  nonregular 
Dirichlet  boundary  conditions  and  pointwise  state  constraints.  It  is  well  known  that 
the  Dirichlet  boundary  control  case  is  the  most  challenging  and  the  least  developed 
since  such  conditions  offer  the  lowest  regularity  properties  of  the  parabolic  dynamics; 
cf.  [1],  [2],  [5]— [12],  [17],  and  references  therein.  The  presence  of  pointwise  state  con¬ 
straints  brings  an  additional  nonsmoothness  to  optimal  control  problems  and  requires 
the  development  of  special  methods  for  their  variational  analysis. 

In  this  paper  we  provide  such  an  analysis  based  on  the  theory  of  mild  solutions 
to  nonregular  parabolic  systems  and  well-posed  smooth  approximations.  Crucial 
elements  of  this  analysis  and  the  corresponding  results  have  been  presented  in  [14]— 
[16]  for  certain  special  cases  of  the  problem  under  consideration  related  to  minimax 
control  in  uncertainty  conditions. 

In  this  paper  we  consider  a  general  Dirichlet  boundary  control  problem  with  a 
nonlinear  integral  cost  functional  involving  the  final  state  of  the  n-dimensional  lin¬ 
ear  parabolic  equation.  Under  natural  assumptions  we  prove  the  existence  of  op¬ 
timal  controls  and  necessary  optimality  conditions  in  the  presence  of  magnitude 
control  and  state  constraints.  To  obtain  necessary  optimality  conditions  for  the 
state-constrained  problem  we  develop  a  constructive  penalization  procedure  involving 
smooth  approximations  of  multivalued  maximal  monotone  operators.  We  establish 

*This  research  was  partly  supported  by  the  National  Science  Foundation  grant  DMS-9404128 
and  the  USA-Israel  BSF  grant  94-00237. 
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the  well-posedness/ 'strong  convergence  of  approximations  in  appropriate  spaces  and 
derive  necessary  optimality  conditions  for  approximating  solutions.  Finally,  neces¬ 
sary  optimality  conditions  for  the  original  state-constrained  problem  are  established 
under  a  proper  constraint  qualification  which  is  different  from  the  standard  Slater 
interiority  type. 

The  paper  is  organized  as  follows.  In  Section  2  we  formulate  and  discuss  the 
Dirichlet  boundary  control  problem  of  our  study,  present  preliminary  results  from  the 
theory  of  mild  solutions,  and  prove  a  general  existence  theorem  of  optimal  controls. 
Section  3  concerns  with  the  development  and  justification  of  the  main  approximation 
procedure;  it  contains  convergence  results  as  well  as  necessary  optimality  conditions 
for  approximating  solutions.  In  the  final  Section  4  we  furnish  a  limiting  process 
to  derive  necessary  optimality  conditions  for  the  original  state-constrained  problem 
under  a  proper  constraint  qualification. 

2.  Problem  Setting  and  Existence  of  Optimal  Solutions 

Let  D  C  RN  be  an  open  and  bounded  domain  whose  boundary  T  is  an  (n  - 
1)— dimensional  manifold.  With  T  >  0  we  set  Q  :=  (0 ,  T)  x  Q.  and  E  :=  (0,T]  x  F. 
Let  A  be  a  second-order  uniformly  strongly  elliptic  operator  on  D  given  in  the  form 

a  ■=  -  + %ai[x)h + %(i) 

with  the  smooth  real- valued  data  aij(x),  aj(z),  and  oo(z). 

We  consider  the  following  Dirichlet  boundary  control  system  for  linear  parabolic 
equations 

'yt  +  Ay  =  f  a.e.  in  Q, 

(2.1)  •  y(0,x)  =  y0{x),  x  G  Q, 

y{t,x)  =  u(t,x),  (t,x)  G  E 

where  yt  denotes  the  derivative  of  y  with  respect  to  time  t,  f  G  L°°(Q),  and  yo{x)  G 
Hq(D)  fl  H2(D).  In  what  follows  we  impose  pointwise  state  and  control  constraints  of 
the  magnitude  type: 

a  <  y(t,x)  <  b  a.e.  (t,x)  e  Q, 
c  <  u(t,x )  <  d  a.e.  (t,x)  G  E 

where  both  intervals  [a,  6]  and  [c,  d]  contain  0. 

We  say  that  u  is  a  feasible  control  to  system  (2.1)  if  the  corresponding  trajectory 
y  satisfies  the  state  constraints  (2.2).  We  always  assume  that  system  (2.1)  admits  at 
least  one  feasible  control  u. 

Denote  by 

Uad  '■=  {u  €  17(0,  T;  L2{ r))  |  c  <  u(t,  x)  <  d  a.e.  (t,  x)  G  E} 

the  set  of  admissible  controls  where  p  is  a  positive  number  that  will  be  specified 
later.  In  the  sequel  the  solution  (trajectory)  y  of  (2.1)  corresponding  to  u  G  Uad  is 
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understood  in  the  mild  sense',  cf.  [1],  [8]— [11] ,  [18].  This  means  that  y  :  [0,  T]  — >  L~(Q) 
is  continuous  and  admits  the  following  Cauchy-like  representation: 


(2.3) 


y(t)  =  S{t)y0  +  S(t  —  r)/(r)dr 

+  f  A3/4+sS(t  -  T)A1/4-sDu{T)dT  VS  G  (0, 1/4] 


where  S(-)  is  the  strongly  continuous  analytic  semigroup  generated  by  the  operator 
-A,  /(•)  G  L°°(Q),  and 

D  :  L\T)  —  V(A1/4-6)  =  H1/2~2S(fl) 


is  the  so-called  Dirichlet  map.  The  latter  operator  is  defined  by  z  =  Du  through  the 
solution  of  the  elliptic  boundary-value  problem 


j  —Az  =  0  in  Q, 

1  z(t,x)  —  u(t,x),  ( t,x )  G  S. 


It  is  well  known  that  the  Dirichlet  map  is  continuous  for  6  G  (0, 1/4]  and,  moreover, 
system  (2.1)  has  a  unique  mild  solution  for  each  u  G  Uad  when  p  is  sufficiently  large; 
see,  e.g.,  [11]  and  [16]  for  more  discussion  and  references. 

Note  that,  being  a  L2(D)-valued  function,  y(-)  =  y(t,x)  is  merely  measurable  with 
respect  to  (t,x).  This  lack  of  continuity  creates  certain  technical  difficulties  to  deal 
with  nonregular  Dirichlet  boundary  conditions.  Nevertheless,  mild  solutions  provide 
a  reliable  ground  to  study  optimal  control  problems  involving  such  conditions. 

Let  us  consider  the  performance  index  (cost  functional)  given  by 


(2.4) 


J{u,y)  '.=  J^ip(y(T, x))dx  +  IL  g(t,x,y(t,x))dtdx 

+  JJ  h(t,x,u(t,x))dtdax 


where  ax  is  the  Lebesgue  measure  on  T.  Observe  that  the  first  term  in  (2.4)  depends 
on  the  final  state  of  (2.1)  that  creates  additional  difficulties  in  the  framework  of 
nonregular  Dirichlet  boundary  conditions;  see,  e.g.,  Chapter  3  of  [12]. 

Throughout  the  paper  we  impose  the  following  hypotheses  on  the  integrands  in 
(2.4): 

(i)  ip  G  C1(M)  and  there  is  a  nonnegative  function  ki  G  L2( R)  as  well  as  a  constant 
Ci  >  0  such  that 

|<p'(z)|  <  ki(z)  +  ci\z\  Vz  G  R. 


(ii)  g  is  measurable  in  ( t,x ),  continuous  in  y,  and  \g(t,x,y)\  is  majorized  by  a 
L1(Q)-function  for  all  y  G  [a,  6].  In  addition,  is  measurable  in  ( t,x )  for 

any  y  G  R.  and  there  is  a  nonnegative  function  k2  G  L2(Q)  as  well  as  a  constant 
c2  >  0  such  that 
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(iii)  h  is  measurable  in  (f,  x),  convex  and  continuous  in  u,  and  bounded  from  below 

by  a  L1(E)-function  for  all  u  £  [c,  d],  In  addition,  —  is  measurable  in  (t,x) 

for  any  and  there  is  a  nonnegative  function  £  Lg(E)  (1  /p+  1  jq  —  1) 

such  that 

dh 

\—(t,x,u)\<k3(t,x)  a.e.  (t,x)  £  E,  Vu  £  [c, d] . 

The  main  concern  of  this  paper  is  the  following  optimal  control  problem: 

(P)  minimize  the  cost  functional  (2.4)  over  the  Dirichlet  boundary  control  system 
(2.1)  subject  to  u  £  Uad  and  the  state  constraints  (2.2). 

The  first  question  we  consider  is  the  existence  of  optimal  controls  to  problem  (P). 
To  establish  a  general  theorem  in  this  direction  as  well  as  other  convergence  results 
for  mild  solutions  of  (2.1)  we  are  going  to  employ  certain  continuity  properties  of  the 
linear  operator 

(2.5)  Cu  =  (Cu)(t)  :=  [  A3/4+6S(t  —  t)A1/4~~sDu(t)cIt 

Jo 

from  17(0, T;  L2(F))  into  Lr(0,T;H1^2~s(Q))  where  p,  r  £  [1,  co] ,  8  £  (0,1/4],  and 
e  6  (0,1/2].  Here  ff1/'2~s(Q)  c  L2(fl)  is  the  Sobolev  space  whose  norm  ||s/||i/2— e» 
being  stronger  than  can  be  defined  by  ||y||i/2-£  :=  ||>l1/4~£/2 ||L2(n);  cf.  [12]. 

Note  that  H°(Q)  =  L2(Q).  When  t  =  T,  we  use  CT  to  denote  (2.5). 

The  following  assertion  was  proved  in  [16],  Proposition  3.1,  based  on  estimates  in 
Washburn  [18]  and  Lasiecka-Triggiani  [9].  Similar  but  somewhat  different  properties 
were  established  in  [10]. 

Proposition  1.  Let  p  >  4/e  with  e  £  (0, 1/2].  Then  the  operator  £  :  17(0,  T;  L2(r)) 
— »  C(\0,T]\  H1/2~e(Ll))  is  linear  and  continuous.  Moreover,  the  operator  £T  :  17(0,  T; 
L2(r))  — >  if1/2_e(n)  is  also  continuous  and  its  adjoint  operator  Cf  :  H~l^2+e(£t)  —* 
L9(0,T-,  L2(r))  (1/p  +  1/q  =  1)  is  given  by  C*T  =  (AS (T  -t)D)* . 

The  next  assertion,  proved  in  Proposition  3.4  of  [16],  is  crucial  in  passing  to  the 
limit  in  approximation  procedures  throughout  the  paper. 

Proposition  2.  Let  p  >  4/e  with  e  £  (0, 1/2).  Then  the  weak  convergence  of  un  — >  u 
in  17(0,T\L2(Y))  implies 

£un  — »  Cu  strongly  in  L2(Q)  as  n  — >  oo. 

In  what  follows  we  always  assume  that  p  is  sufficiently  large  to  ensure  the  conver¬ 
gence  property  in  Proposition  2  with  some  e  £  (0, 1/2).  Now  we  can  formulate  and 
prove  the  existence  of  optimal  controls  in  (P). 

Theorem  3.  Under  the  assumptions  made  above  there  exists  an  optimal  solution 
(u,y)  £  Uad  x  C([0,T]-Hll2-e(n))  to  the  Dirichlet  boundary  control  problem  (P). 

Proof.  Let  (un,  yn),  n  =  1, 2, . . . ,  be  a  minimizing  sequence  of  feasible  controls  u„  in 
(P).  For  each  n  =  1,2,...  we  consider  the  corresponding  mild  solution  yn  of  system 
(2.1)  that  is  uniquely  defined  by  un  and  belongs  to  the  space  C7([0, T\;  H1<'2~S(Q)) 
where  e  is  any  given  number  in  (0, 1/2]  (this  easily  follows  from  Proposition  1).  We 
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always  take  e  <  1/2  to  ensure  the  convergence  property  in  Proposition  2  with  large  p. 
Since  {un}  C  Uad  is  weakly  compact  in  1^(0,  T;  L2(r)),  there  exist  a  control  u  E  Uad 
and  a  subsequence  of  {un},  still  labelled  as  {un},  such  that 

un  — ►  u  weakly  in  Lp(0,  T;L2(P))  as  n  — >  oo. 

Proposition  1  ensures  that  operator  (2.5)  acting  from  1/(0,  T;  i2(P))  into  C([0,T]; 
i/1/2"£(f2))  is  weakly  continuous.  By  (2.3)  this  implies  that 

yn->y  weakly  in  <?([0,T];  i/1/2_£(H)) 

where  y  is  a  mild  solution  of  (2.1)  corresponding  to  u.  Now  employing  Proposition 
2,  we  conclude  that 

yn  — >  y  strongly  in  L2{Q)  as  n  — >  oo. 

The  latter  ensures  the  existence  of  a  subsequence  {ynk}  C  {yn}  with 

yUk(t,x)  — >  y(t,x)  a.e.  ( t,x)eQ  as  k  -+  oo. 

Such  a  pointwise  convergence  implies  that  the  limiting  trajectory  y  satisfies  the  state 
constraints  (2.2)  since  each  yn  has  this  property.  Therefore,  u  is  a  feasible  control  to 

(P)- 

To  prove  the  optimality  of  u  in  (P)  we  invoke  the  well-known  fact  that  due  to 
(iii)  the  last  term  in  (2.4)  is  a  weakly  lower  semicontinuous  functional  in  the  space 
Lp(0,T;L2(r)).  Furthermore,  the  Lebesgue  dominated  convergence  theorem  allows 
us  to  pass  to  the  limit  under  the  integral  signs  in  the  first  and  second  terms  of  (2.4), 
due  to  the  pointwise  convergence  of  ynic  — >  y  and  assumptions  (i)  and  (ii).  Therefore, 

J{u,y)<  lim  inf  J ( unk ,  y„k ) 

k—>  oo 


that  proves  the  optimality  of  u  in  (P).  □ 

Remark.  We  do  not  need  smoothness  assumptions  on  <p,  g,  and  h  to  prove  the 
existence  of  optimal  controls  in  Theorem  3.  The  most  essential  requirements  for 
this  are  the  convexity  of  h  in  u  and  the  right  choice  of  p  and  e  ensuring  the  conver¬ 
gence/continuity  properties  in  Propositions  1  and  2.  However,  we  use  the  smoothness 
assumptions  in  the  subsequent  sections  to  derive  necessary  optimality  conditions.  To 
simplify  the  exposition  we  have  combined  all  the  assumptions  together. 

3.  Necessary  Optimality  Conditions  in  Well-Posed  Approximations 

In  this  section  we  develop  a  well-posed  approximation  procedure  allowing  us  to 
remove  the  state  constraints  in  (P).  We  establish  an  appropriate  strong  convergence 
of  approximations  and  derive  necessary  optimality  conditions  for  approximating  so¬ 
lutions.  The  latter  results  can  be  viewed  as  suboptimality  conditions  for  the  state- 
constrained  problem  (P)  being  the  base  to  obtain  necessary  optimality  conditions  for 
(P)  in  the  next  section. 
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Let  a  :  M  =>  E  be  a  multivalued  maximal  monotone  operator  of  the  form 

[0,  oo)  if  r  =  b 
(—00, 0]  if  r  =  a 
0  if  a  <  r  <  b 

0  if  either  r  <  a  or  r  >  b. 


a(r)  = 


Using  the  Yosida  approximation  7  1(r  —  (1  +  70;)  V)  ofa(-)  and  then  a  C^-mollifier 
in  M,  we  may  choose  a  smooth  approximation  of  a(-)  as 


(3.1) 


a7(r)  =  \ 


7  1(r  —  b)  —  1/2 
(272)-i(r  _  6)2 
7-1(r'  -  a)  +  1/2 
—  (272)_1(r  —  a)2 
0 


if  r  >b  +  7 
if  b  <r  <b  +  7 
if  r  <  a  —  7 
if  a  —  7  <  r  <  a 
if  a  <r  <b 


with  the  property  |7cc^(r)|  <  1  for  all  r  €  R  and  7  >  0;  cf.  [2],  p.  322. 

Let  (u,  y)  be  a  given  optimal  solution  to  problem  (P).  We  consider  the  following 
parametric  family  of  boundary  control  problems  without  state  constraints: 

(P7)  minimize  J7(u,y )  :=  J(u,y)  +  ||u  -  u\\pLnoT.L2{T))  +  tIK^IIlW;^)) 
over  u  E  Uad  subject  to  system  (2.1). 

We  are  going  to  study  problems  (P7)  from  the  three  perspectives:  existence  of 
optimal  solutions,  their  convergence  to  (u,  y)  as  7  — >  00,  and  necessary  optimality 
conditions  for  them  as  7  >  0.  The  next  proposition  answers  the  first  question. 


Proposition  4.  Let  p  >  4/s  with  e  E  (0, 1/2).  For  each  7  >  0  problem  (P7)  has  at 
least  one  optimal  solution  (u7,y7)  E  Uad  x  (7([0,  T]; 


Proof.  The  set  of  feasible  solutions  to  (P7)  is  not  empty  since  it  obviously  contains 
(u,  y)  for  any  7  >  0.  First  we  should  check  that  the  cost  functional  in  (P7)  is  proper, 
i.e., 


(3.2)  J~,(u,y)  >  -00  V7  >  0 

for  all  feasible  solutions  ( u,y )  to  (P7).  It  easily  follows  from  assumptions  (i)— (iii) 
that 

J{u,y)  +  ||u  —  >  — 00. 

To  establish  (3.2)  it  remains  to  show  that 


(3.3) 


\\a7(y)  |U2(0,T;L2(n))  <  00  V7  >  0. 


Taking  into  account  the  definition  of  mild  solutions  (2.3)  and  estimates  in  [9]  and 
[18],  one  gets 


||2/(£)IU2(fi)  <  M(1  + 


max 


{|c|,d}\/meas(r)  , 

- - - 1  *  ) 


1-46 


for  any  fixed  6  E  (0, 1/4) 


with  some  constant  M  >  0.  Due  to  (3.1)  the  latter  implies  (3.3)  and  hence  (3.2);  cf. 
the  proof  of  Proposition  4.1  in  [16]  for  more  details. 

Now  arguing  as  in  the  proof  of  Theorem  3  and  using  Propositions  1  and  2,  we 
conclude  that  the  cost  functional  in  (P7)  is  weakly  lower  semicontinuous  on  the  set 
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of  feasible  controls  Uad  which  is  weakly  compact  in  1^(0,  T;  L2(T)).  Therefore,  the 
existence  of  optimal  controls  in  (P7)  follows  from  the  classical  Weierstrass  theorem. 

□ 

Next  we  establish  the  well-posedness  of  the  approximation  procedure  under  con¬ 
sideration  proving  the  strong  convergence  of  optimal  solutions  in  (P7)  to  the  given 
optimal  pair  (u,y)  in  the  original  problem  (P). 

Theorem  5.  Let  ( u,y )  be  the  given  optimal  solution  to  (P)  and  let  {(u7,r/7)}  be  a 
sequence  of  optimal  solutions  to  the  approximating  problems  (P7).  Then  there  is  a 
subsequence  {7*,}  C  {7}  such  that 

ulk  — >  u  strongly  in  Lp(0,T-,L2(T)),  y^k  — *  y  strongly  in  C([Q,T]]Hl^~£(Q)), 
and  Jyk(u-yk,y^k)  -»  J{u,y)  as  k  — ►  00. 

Proof.  Since  (u,  y)  is  feasible  to  (P7)  for  each  7  >  0,  one  has 

(3.4)  J7(u7,?/7)  <  J7(u,y)  =  J(u,y)  V7  >  0. 

Due  to  (3.4)  and  assumptions  (i)-(iii)  we  get 

711^7(2/7) lli2(o,r;i2(n))  ^  M  V7  >  0 
for  some  constant  M.  This  yields 

(3.5)  7lla7(2/7)IU2(o,T;L2(n))  — 0  as  7  — >  0. 

Since  Uad  is  weakly  compact  in  the  reflexive  Banach  space  7/(0,  T;  L2(T)),  there 
exists  a  subsequence  of  {u7},  still  denoted  by  {-tz7},  such  that 

(3.6)  u7  — >  u  weakly  in  Lv{ 0,  T ;  L2(r))  as  7  — >  0 

for  some  u  e  Uad-  Denote  by  y  a  mild  solution  of  (2.1)  corresponding  to  u  and 
employing  Proposition  2,  one  can  find  a  subsequence  {7*,}  C  {7}  such  that 

(3.7)  ylk(t,x)  — >  y(t,x)  a.e.  in  Q  as  k  — >  00. 

To  pass  to  the  limit  in  (3.4)  we  need  to  show  that  y  satisfies  the  state  constraints 
(2.2).  For  this  purpose  let  us  consider  the  following  sets: 

fiL  :=  {x  G  Q.  |  a -7  <  y(t,x)  <  a};  £ f2a  :=  {x  €  Q.  \  y(t,  x)  <0-7}; 

Di;,  :=  {x  £  f2  |  6  <  y(t ,  x)  <b  +  7};  Ll\b  :=  {x  €  £2  |  y(t,  x)  >b  +  7}. 

They  are  Lebesgue  measurable  due  to  the  choice  of  y  €  C([0,  T];  1 T1/2_£(n)).  Taking 
into  account  (3.5)  and  the  structure  of  a7(-)  in  (3.1),  one  has 

[  (  (2"/)~2(y1(t,x)  -  a)4dtdx+  [  [  (yJt,x)  -  a  +  ^jTfdtdx 
Jo  Jn\a  Jo  Jei\a 

+  [  [  {2'y)~2{y1{t,x)  —b)4dtdx 
Jo  Jn\b 

+  [  f  (yJt,  x)  —  b  -  "f/2)2dtdx  — >  0  as  7  — >  0. 

Jo  Jnt. 
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Applying  Lemma  4.2  of  [16]  similarly  to  the  proof  of  Theorem  4.3  therein,  we  conclude 
that 

a  <  y(t,x)  <  b  a.e.  in  Q, 

i.e.,  (ft,  y)  is  a  feasible  pair  to  the  state-constrained  problem  (P).  The  latter  yields 

(3.8)  J(u,y )  >  J{u,y). 

Now  passing  to  the  limit  in  (3.4)  and  taking  into  account  (3.6)-(3.8)  as  well 
as  the  weak  lower  semicontinuity  of  the  cost  functional  (2.4)  in  the  control  space 
17(0,  T;  L2(r)),  we  arrive  at 

(3.9)  hm  \\ulk  -  ft|IIP(0iT;z,2(r))  =  0  and  hrn  lk\\o‘'ik(,ylk)\\h(o,T-,L2{n))  =  °- 

The  first  equality  in  (3.9)  means  that  ulk  — >  u  strongly  in  1/(0, T]  L2(T))  as  k  — +  oo. 
By  Proposition  1  this  implies  that  y7k  — »  y  strongly  in  C'QO,  T\\  H1/2^ (0))  as  k  — >  oo. 
Therefore,  one  has  u  =  u  and  y  =  y.  Finally,  the  cost  functional  convergence  in  the 
theorem  follows  from  the  second  equality  in  (3.9).  □ 

The  last  result  of  this  section  provides  a  necessary  condition  for  an  optimal  control 
u7  to  each  approximating  problem  (P7).  This  condition  is  expressed  in  terms  of  the 
adjoint  operators  to  C  and  CT  considered  in  Proposition  1. 


Theorem  6.  Let  (u7,  j/7)  be  an  optimal  pair  to  problem  (P7).  Then  one  has 

0  <  JJj(C*Ttp'(y7))(t,x)  +  C*(^(t,x,y7)  +  27o4(y7)oy(i/7)) 

(3.10)  » 

+  —  (t,  x,  u7)]udtdax  +  2 p  j ^  ||u7  -  u J^(u7  -  u)udax)dt 


for  any  u  g  1/(0,  T;  L2(P))  such  that  u7  +  9u  g  Uad  for  all  6  g  [0, 0O]  with  some 
0O  >  0. 


Proof.  Consider  variations  of  u7  of  the  form  u7  +  9u  g  Uad  with  u  E  1/(0,  T;  L2(F)) 
where  0  g  [0, 60]  for  some  0O  >  0.  Denote  by  y7U  a  mild  solution  of  (2.1)  corresponding 
to  u7  +  6u  and  consider  a  function  if  :  [0, 0O]  — 1 ►  IK  defined  by 

ip(9)  :=  J7(u7  +  6u,  y7U). 

Clearly  ip  attains  its  minimum  at  6  =  0.  Moreover,  Proposition  1  implies  that 

y-yu  -*•  y-y  Strongly  in  C'([0,T’];  H1/2~e(fl))  as  0  -*•  0  and 

Vyu(T,  x)  —  y7(T,  x)  _  £  ^  U'fu(t,  x)  —  y7(t,  x)  _  V0  >  0 
0  T  9 

Employing  these  results  and  the  classical  mean  value  theorem,  we  come  up  with 
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+  J  J  —(t,x,u7  +  6z0u)9udtd(ix 

+  J  (||w7  +  9u-  u||^2(2r)  +  ■  -  -  +  ||«7  —  w|li2(2r))( J  0u(2u7  -2 u  +  6u)dax)dt 

+  7  ffQ(ai(yr^  +  aM)ai(y-r  +  ei{Vin  -  V^iVyu  -  y1)dtdx\ 

where  9i  =  6ft,  x )  G  [0,1]  a.e.  in  Q  for  i  =  1,2, 3, 4.  Observe  that  9i(yiu  - 
yf)  — >  0  strongly  in  L2(Q)  as  0  — >  0  for  i  =  1,2, 3, 4  and  that  ay(yyu)  +  a7(y7)  G 
Z,2(0,T;  L2(Q)).  Then  by  using  assumptions  (i)-(iii)  and  the  Lebesgue  dominated 
convergence  theorem,  we  obtain 


(3.11) 


0  <  J^ip'(y1(T,x))£Tudx  +  J J^(^(t,x,y7)  +  2ja,1(y-y)a1(yy))£udtdx 
dh , 


+  J  J  —(t,x,  u^)udtdax  +  2p  J  ||u7  —  J  {u7  -  u)udcrx)dt. 

The  latter  implies  (3.10)  and  ends  the  proof  of  the  theorem.  □ 


4.  Necessary  Optimality  Conditions  with  State  Constraints 

In  the  last  part  of  this  paper  we  develop  a  limiting  procedure  to  derive  neces¬ 
sary  optimality  conditions  for  the  original  Dirichlet  boundary  control  problem  (P) 
with  pointwise  state  constraints.  This  procedure  is  based  on  passing  to  the  limit  in 
necessary  optimality  conditions  for  the  approximating  problems  (P7)  by  taking  into 
account  the  strong  convergence  results  established  in  Section  3.  Analyzing  these  nec¬ 
essary  optimality  conditions  (Theorem  6),  we  can  observe  that  to  pass  to  the  limit 
therein  one  needs  to  get  a  uniform  bound  for  the  perturbation  term  7 q7(-)q:7(-)  in 
an  appropriate  space.  Such  a  bound  does  not  follow  from  the  previous  consideration 
without  additional  assumptions.  To  furnish  this  let  us  impose  a  constraint  qualifica¬ 
tion  condition  (CQ)  for  the  state  constraints  in  problem  (P).  In  what  follows  ]|  •  ||oo 
and  ||  ■  ||i  denote  the  norms  in  L°°(Q )  and  Ll{ Q ),  respectively. 

(CQ)  There  exist  u  G  Uad  and  rj  >  0  such  that  for  all  £  G  L°°(Q)  with  ||C||oo  <  1 
the  mild  solution  y  of  ( 2.1)  corresponding  to  u  satisfies  the  condition 

a  <  y(t,x)  +  rj((t,x)  <  b  a.e.  in  Q. 

Observe  that  this  qualification  condition  is  different  from  the  classical  Slater  in- 
teriority  one  in  the  corresponding  space;  compare,  e.g.,  [13],  In  particular,  (CQ) 
does  not  imply  that  the  set  of  feasible  trajectories  y  has  nonempty  interior  in  the 
space  C([0,T];/f1/2_£(fl)).  We  refer  the  reader  to  [3]  and  [4]  for  more  discussions  on 
the  related  qualification  conditions  for  the  case  of  parabolic  systems  with  distributed 
controls. 

The  next  lemma  provides  the  desired  uniform  estimate  that  turns  out  to  be  crucial 
in  our  limiting  procedure. 

Proposition  7.  In  addition  to  the  assumptions  made  above  we  impose  the  qualification 
condition  (CQ).  Then  there  exists  a  constant  C  >  0  such  that 

(4.1)  ||7a7(y7)a7(y7)||1  <  C  V7  >  0. 
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Proof.  Given  u  in  (CQ),  let  us  substitute  u  =  u  -  u7  into  (3.11).  Employing  the 
monotonicity  of  a7(-):  one  has 

0<  J  ip'(y7(T,x))£T(u-u7)dx  +  J J^^(t,x,y7)C(u  -  u7)dtdx 
+  J f^^(t,x,u7)(u  -  u7)dtdax 

+  2p  J  ||it7(t)  -  w(<)|lx,2(r )(j^,(u7  -  w)(m  —  u7)dax)dt 

+  2  JJQ^(y7)a7(y7)(£u  -  Cu7)dtdx 

<  f^ip\y7(T,x)){y(T,x)  -  y7{T,x))dx  +  J j  ^(t,x,y7)(y  -  yjdtdx 
+  J J  ^(t,x,u7)(u  -  u7)dtdcrx 

T  2 p  J  ||u7(f)  u(£)  [[l2(f) (^7  u)(u  u7)dax)dt 

-  2  //  -  a7(y  +  f?C))(2/7  ~V~  vC)dtdx 

-2*7  J jQloi'1{y1)a7{y7)C,dtdx 

<  Jn  <p'(y7(T,  x))(y(T,  x)  -  t/7(T,  x))dx  +  J ~{t,  x,  y7)(y  -  y7)dtdx 

+  JJ  ^(t,x,u7)(u  -  u7)dtdax 

+  2pJq  IKW  -  u(t)\\pL^r){J^u7  -  u)(u  -  u7)dax)dt 

-  j J^ja\{yi)a7(y7)Cdtdx  V<  G  L°°(Q )  with  ||C||oo  <  1- 

Now  taking  into  account  Theorem  5,  we  can  find  a  constant  C  >  0  independent  of  7 
such  that 

J j  ^(y7)a7(y7)Cdtdx  <  C  V7  >  0  VC  G  L°°(Q )  with  ||C||oc  <  1. 


This  estimate  yields  (4.1)  and  ends  the  proof.  □ 

Let  us  denote  by  ba(Q)  the  space  of  bounded  additive  functions  (generalized  mea¬ 
sures)  on  subsets  of  Q  that  vanish  on  sets  of  the  Lebesgue  measure  zero.  It  is  well 
known  that  this  space  can  be  identified  with  the  dual  space  to  L°°(Q)  in  the  following 
sense:  for  each  A  e  (L°°(Q))*  there  is  a  unique  A  G  ba(Q)  such  that 


A (w)  =  JJ  w\{dtdx )  Vw  G  LTO(Q). 


In  the  sequel  we  do  not  distinguish  between  A  G  (L°°(Q))*  and  its  counterpart 
A  G  ba(Q).  Recall  that  (supp  A)  means  the  support  set  for  A  G  (L°°(Q))*  where  this 
measure  is  not  zero.  In  what  follows  the  convergence  along  a  generalized  sequence 


B.S.  Mordukhovich  and  K.  Zhang 


233 


means  the  convergence  of  a  net  in  the  weak*  topology  of  the  space  ( L°°(Q ))*  where 
the  topological  and  sequential  limits  are  different. 

For  the  optimal  trajectory  y(t,  x)  to  problem  (P)  we  define  the  set 

Qab  :=  {(t,x)eQ  |  y(t,  x)  =  a  or  y(t,  x )  =  b} 

where  the  state  constraints  (2.2)  are  active.  This  set  plays  an  essential  role  in  the 
results  below. 


Proposition  8.  Under  the  assumptions  made  in  Proposition  7  there  exist  A  £  (L°°(Q))* 
with  supp  A  C  Qab  and  a  generalized  sequence  of  {7}  along  which 


2^a'1(yQa1{yQ)  — >  A  weakly*  in  (L°°(<2))*  as  7  — >  0. 

Proof.  We  just  sketch  the  proof  referring  the  reader  to  [16]  for  more  details  in  a 
similar  setting.  Let  us  define 


A 7(w)  :=  2  J  J  'ya'J(y1)a1(yQwdtdx  Vw  6  L°°(Q) 


for  each  7  >  0.  Proposition  7  ensures  the  uniform  boundedness  of  {A7}  in  ( L°°(Q ))*. 
Due  to  weak*  compactness  of  the  unit  ball  in  a  dual  space  we  find  A  £  ( L°°(Q ))*  and 
a  generalized  sequence  of  {7}  along  which 


(4.2)  hm  A7(w)  =  lim  2  JJ  'ya'^(y~/)a7(yQwdtdx  =  A(w)  Vw  £  L°°(Q). 


It  remains  to  show  that  supp  A  C  Qab-  To  this  end  we  observe  that 


meas({(t,a;)  £  Q  \  y(t,x)  <  a  or  y(t,x )  >  b })  =  0. 

Thus  assuming  that  supp  A  (/L  Qab,  one  has  a  set  Q  with  the  properties 

(4.3)  meas(Q)  >  0,  A (Q)  ^  0,  and  Q  C  {(t,  x)  £  Q  \  a  <  y(t,x )  <  b}. 

Now  arguing  in  the  same  way  as  in  [16],  we  find  a  nonnegative  function  c(p)  such 
that  c{p)  — >  0  when  p  — >  0  and 

|A(m)|  <  c(p)  Vw  £  L°°(Q),  supp  w  c  Q 

for  all  p  sufficiently  small.  This  yields 

A(w)  =  0  Vw  £  L°°(Q),  supp  w  c  Q 


which  contradicts  (4.3)  and  ends  the  proof  of  the  proposition.  □ 

Now  we  are  ready  to  derive  necessary  optimality  conditions  for  optimal  solutions 
to  (P)  by  passing  to  the  limit  in  (3.10).  Due  to  the  weak*  convergence  result  of 
Proposition  8  we  need  to  show  that  the  operator  C  defined  by  (2.5)  is  continuous  from 
L°°(I1)  into  L°°(Q)  (note  that  this  is  different  from  Proposition  1).  The  next  theorem 
establishes  this  property  and  provides  the  desired  necessary  optimality  conditions  for 
the  original  state-constrained  problem  (P). 
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Theorem  9.  Let  ( u ,  y )  is  an  optimal  solution  to  problem  (P)  under  all  the  assumptions 
made  above.  Then  there  is  a  measure  A  €  (L°°(Q))*  with  supp  A  C  Qab  such  that 


0  <  j fj{C*Tip'{y))(t,x)  +  +  ^{t,x,u)\(u  -  u)dtdax 


(4.4) 


+ 


JJ  (u  —  u)(C*  \)(dtdox)  G  Uad. 


Proof.  Let  {(u7,y7)}  be  a  sequence  of  optimal  solutions  to  problems  (P7)  that 
strongly  converges  to  ( u ,  y )  due  to  Theorem  5  satisfying  the  necessary  optimality 
conditions  in  Theorem  6.  It  follows  from  (3.10)  that 


0  <  J Jj(jC*T(p'{y7))(t,x)  +  C*(^{t,x,y7)  +  2ja'7(y7)a7(y7)) 
dh 

(4-5)  +  —  (t,x,Uy)](u  -  u7)dtderx 

■p  2 p  J  ||u7  ^ll z,2(r) ( ^ (^7  vf)(u  u7)dox)dt  'tfu  G  Uad. 

Our  purpose  is  to  pass  to  the  limit  in  (4.5)  as  7  -*•  0  along  a  generalized  subsequence. 
Due  to  Proposition  1,  Theorem  5,  and  the  well-known  continuity  of  the  operator 
£*  :  L2(0,  T;  L2(fi))  -f  £2(0,  T;  £2(r))  (see,  e.g.,  [11])  we  have  y/(y7(T,  •))  G  L\Q)  C 
j^-i/2+e(^)  for  all  7  >  0  and 

J Jj(CT<p'{y7))(t,x)  +  C*{^(t,x,y7))  +  ^(t,x,u7)\(u  -  u7)dtdax  -* 
JJ^[{jC*T<p'(y))(t,x)  +  £*(|^(i,  x,  y))  +  x, «)](«  -  u)dtdax  Vu  G  Uad. 
Since  the  last  term  in  (4.5)  converges  to  0,  it  remains  to  show  that 

(4.6)  //(«  -  u7)C*(2'ya'7(y7)a7(y7))dtdox  ->  J  jju  -  u)(C* \){dtdax) 

as  7  — >  0  for  any  u  G  Uad-  Due  to  Proposition  8  property  (4.6)  immediately  follows 
from  the  weak*  continuity  of  the  operator  C*  :  {L°°(Q))*  — >  (L°°(E))*.  In  turn,  this 
weak*  continuity  of  the  adjoint  operator  is  a  direct  consequence  of  the  strong  conti¬ 
nuity  of  the  operator  £  in  (2.5)  considered  from  L°°(E)  into  L°°(Q).  To  justify  the 
latter  property  we  follow  [16]  and  invoke  some  results  from  the  theory  of  generalized 
solutions  to  parabolic  equations. 

Let  v  G  L2(E)  be  a  boundary  condition  in  (2.1).  According  to  [12,  Theorem  9.1], 
there  is  a  unique  y(v)  G  L2(Q),  called  a  generalized  solution  to  (2.1),  such  that 

(4.7)  ffQy(v)(-^  +  A’z)dtd*  = 

Vz  G  {z  G  H2,1(Q)  I  z(t,x)  =  0,  (t,x)  G  E,  z(T,x)  =  0} 

where  vA  is  an  outer  normal  to  T  associated  with  the  operator  A. 

Let  v  G  L°°(E)  and  let  y  =  Cv  be  the  corresponding  mild  solution  to  system  (2.1). 
We  are  going  to  show  that  such  y  coincides  with  the  generalized  solution  to  (2.1)  in 
the  sense  of  (4.7).  Since  L°°( E)  C  7/(0,  T;  L2(r)),  we  may  consider  v  as  an  element 
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of  LP(Q,  T;  I/2(r))  and  use  the  fact  that  D(E),  the  space  of  C°°  functions  on  E  with 
compact  supports,  is  dense  in  1/(0, T-,  L2(T)).  This  gives  a  sequence  {vn}  C  T>( E) 
such  that 

vn  —*  v  strongly  in  Lp( 0,  T;  L2(T))  as  n  — >  oo. 

Since  for  each  vn  G  D(E)  system  (2.1)  has  a  unique  classical  solution  yn,  we 
automatically  get  that  yn  =  Cvn  and  it  satisfies  (4.7).  Moreover,  it  follows  from 
Proposition  1  with  e  =  1/2  that 

||£v  —  Vn  ||c([0,T];L2(n))  =  ||£«  ~  £Wn||c([0,T];L2(n))  ~ >  0  as  n  >  OO. 

Taking  into  account  all  these  facts,  we  have 

|  [[  Cv(~  +  A*z)dtdx  +  [[  vp-dtdax\ 

J  Jq  at  J  J e  ova 

- 1  JUcv  -  + A'*)dtdxl + 1  JUv  -  v")^tdtdir‘i 

dz 

<  || Cv  -  J/n||c([0,Tl;£2(n))||  -  +  A*z\\L2(0'T.L2(n-))T1/'2 

d  z  - 

+  ||w  —  ^n||LP(0,T;L2(r))||^^-||i2(0,T;L2(r))7’1'/^  “^0  as  71  — >  OO 

C\  /  -j  \ 

where  q  := - — • -.  Thus  we  obtain 

P- 2 

///”<- 1 +  =  -  J L’,-t;d,d'’‘ 

Vz  €  {z  G  H2,1(Q)  |  z(f , a;)  =  0,  (t, a:)  G  E,  x(T,  a:)  =  0}. 

The  latter  means  that  the  mild  solution  y  =  is  also  a  generalized  solution  to  (2.1) 
for  any  v  G  L°°(E).  Using  the  uniqueness  of  generalized  solutions  and  the  fact  That 
the  generalized  solution  operator  is  a  continuous  map  from  L°°(E)  into  L°°(Q)  (see, 
e.g.,  [12,  pp.  205-206]),  we  conclude  that  the  linear  operator  C  is  continuous  from 
L°°( E)  into  L°°(Q).  This  completes  the  proof  of  the  theorem.  □ 
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Nonlinear  Heat  Transfer  Laws 
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Abstract.  Consider  the  heat  equation  with  a  nonlinear  function  a  in  the  boundary  con¬ 
dition  which  depends  only  on  the  solution  u  of  the  initial-boundary  value  problem.  The 
unknown  function  a  belongs  to  a  set  of  admissible  functions.  For  this  problem  the  existence 
of  a  second  Frechet  derivative  of  the  control-state  mapping  is  proved.  Based  on  this  result 
a  necessary  second  order  optimality  condition  is  formulated.  For  the  investigated  objective 
sufficient  second  order  condition  are  closely  connected  with  stability  estimates.  Using  the 
knowledge  about  stability  estimates,  it  is  shown  that  already  for  simple  cases  the  usual 
sufficient  conditions  can  not  be  fulfilled. 
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1.  Introduction 

This  paper  deals  with  the  connection  between  second  order  optimality  conditions 
and  stability  estimates  for  the  identification  of  nonlinear  heat  transfer  laws.  The  iden¬ 
tification  problem  is  formulated  as  an  optimal  control  problem  where  the  unknown 
heat  exchange  function  plays  the  part  of  the  control.  The  unknown  function  a  is 
assumed  as  a  function  of  the  temperature  u.  The  control  system  under  consideration 
is  governed  by  a  semilinear  parabolic  equation,  hence  the  control  problem  belongs 
to  the  class  of  nonconvex  optimization  problems.  In  contrast  to  parabolic  control 
problems  with  convex  objective  functionals  and  linear  equations,  where  the  list  of 
references  and  optimality  conditions  is  very  extensive,  only  a  few  investigations  have 
been  devoted  to  the  case  of  nonlinear  parabolic  equations.  In  nonconvex  problems, 
sufficient  second  order  optimality  conditions  at  the  optimal  point  are  a  substitute  for 
the  convexity.  The  theory  of  sufficient  second  order  conditions  for  twice  differentiable 
extremal  problems  in  function  spaces  is  known  to  be  more  rich  and  interesting  than 
that  for  problems  in  finite-dimensional  spaces.  This  is  due  to  the  so-called  two-norm 
discrepancy  expressing  the  noncompatibility  of  the  norms  needed  for  second  order 
optimality  condition.  This  difficulty  was  resolved  successfully  by  Ioffe  [6]  and  Maurer 
[7] .  The  theory  was  extended  on  a  class  of  parabolic  boundary  control  problems  in  the 
papers  of  Goldberg  and  Troltzsch  [4], [5].  But  the  investigating  type  of  identification 
problems  has  an  other  complicate  structure.  Therefore  it  is  one  goal  of  this  paper  to 
show  that  it  is  impossible  to  get  similar  conditions  for  the  identification  problem. 
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To  ensure  a  sufficient  optimality  condition  it  is  often  required  that  there  exist 
terms  in  the  objective  which  depend  directly  on  the  control.  For  that  reason  the 
usual  way  is  using  a  Tikhonov-regularization.  But  the  Tikhonov-regularization  is 
also  the  most  popular  way  to  get  well-posedness  and  stability  of  inverse  problems,  see 
Tikhonov/ Arsenin  [13],  Tikhonov/Goncharskij/Stepanov/Yagola  [14].  Hence  there  is 
a  close  connection  between  sufficient  second  order  condition  and  stability  in  this  well- 
investigated  class  of  boundary  control  problems.  But  this  approach  has  disadvantages 
for  the  application  to  the  nonlinear  identification  problem.  A  natural  way  to  handle 
the  nonlinear  identification  problem  consists  in  using  a  compact  set  of  admissible 
controls  a  (that  means  “admissible  laws”).  For  practical  applications  it  is  often  easy 
to  bound  the  maximal  growth  of  the  heat  exchange  coefficient  with  respect  to  the 
temperature.  The  compactness  of  the  set  of  admissible  controls  is  the  main  prop¬ 
erty  to  get  statements  about  stability  and  well-posedness  in  suitable  chosen  function 
spaces,  see  Rosch  [8].  In  this  approach  we  have  no  regularity  term.  By  the  way,  it  is 
difficult  to  find  a  proper  function  space  for  the  regularity  term  for  the  identification 
of  a  heat  exchange  coefficient  depending  only  on  the  boundary  temperature.  These 
facts  generate  an  own  specific  of  identification  problems.  The  compactness  of  the 
admissible  control  set  relieves  the  derivation  of  several  properties.  Otherwise,  with 
the  absence  of  the  Tikhonov-term  is  also  absent  the  natural  quadratic  convexity  term 
of  the  objective  with  respect  to  the  control.  Another  difference  is  caused  in  the  non¬ 
standard  structure  of  the  control-state-mapping.  Usually  the  optimality  conditions 
need  no  derivatives  of  the  control,  but  in  our  case  every  differentiation  of  the  ob¬ 
jective  uses  derivatives  of  the  control  a.  For  that  reason  it  is  necessary  to  require 
C2-regularity  of  the  control  in  order  to  get  second  order  conditions. 

In  this  paper  we  want  to  discuss  second  order  optimality  conditions  for  the  iden¬ 
tification  problem  and  the  connection  to  stability  estimates.  The  optimal  control 
problem  we  are  going  to  investigate  is  to  minimize  the  functional 

T 

(1.1)  ’F(a)  =  J  j(u(t,x)  -  q{t,x)fdSxdt, 

o  r 

subject  to 

chi 

-Qj.  (t,x)  =  Axu(t,  x)  on  (0,T]  x  fi 

(1.2)  u(0,a:)  =  u°(r)  on  fi 

—  (t,x)  =  a(u(t,x))(d  —u(t,x))  on  (0,T]  x  T 
an 

where  the  control  a  is  taken  from  the  set 

Uad  ■=  {ot  6  C2\di,  i?2],  o  <  TOi  <  a(u)  <  Ml,  m2  <  a'(u)  <  M2, 
m3  <  a"(u)  <  M3} 

In  this  setting,  fl  C  Rm  is  a  bounded  domain  with  C'°°-boundary  r,  T  >  0  a  fixed 
time,  ■&  a  fixed  temperature  and  q  €  L2((0,T)  x  T)  is  a  given  function  of  “measure- 
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ments” .  0X  and  02  are  defined  by 

min(0,  inf  u°(x)) 

x€$l 

max(0,  sup  u°(x)). 
xeQ 

Interpreting  the  process  as  a  heating  problem,  the  variable  u  means  the  temperature 
of  the  material,  u°  the  initial  temperature,  0  the  constant  temperature  of  the  sur¬ 
rounding  medium,  and  a  the  unknown  heat  transfer  function,  playing  the  part  of  the 
control. 


= 

02  = 


2.  Preliminary  results 

In  this  section  we  introduce  some  notations  and  recall  known  results  on  the  be¬ 
haviour  of  the  parabolic  system  (1.2),  which  belongs  to  the  class  of  semilinear  prob¬ 
lems.  For  convenience  we  shall  apply  the  theory  of  analytic  semigroups.  We  shall 
heavily  rely  on  results  by  Amann  [1],[2]  for  semilinear  parabolic  problems.  The  as¬ 
sumptions  and  preparations  we  shall  need  here  are  nearly  the  same  as  in  Rosch/ 
Troltzsch  [12],  where  well-posedness  of  the  parabolic  system  is  proved. 

In  all  what  follows  we  work  in  Sobolev-Slobodeckij  spaces  Wpa{ fl)  and  Wpa(fl) 
with 

—  <  2cr  <  2d  <  1  + 

V  P 

Note  that  this  inequality  ensures  the  continuity  of  the  regarded  functions.  The  solu¬ 
tion  of  the  heat  equation  u  is  looked  upon  in  the  Banach  space 
£?([(), T],  WpCT(Q))  provided  that  the  initial  value  u°  belongs  to  W%*(Q). 

Let  A  be  a  linear,  positive,  and  elliptic  differential  operator.  Then  the  parabolic 
equation 


u(0)  =  u° 

subject  to  homogeneous  boundary  conditions  gives  rise  to  an  analytic  semigroup  of 
linear  continuous  operators  denoted  by  S(t). 

Following  [12]  we  define  A:  Lp{ f2)  D  D(A )  — *  Lp{£l)  by 
D(A)  =  {w  e  Wp(Q)  :  dw/dn  |r=  0},  Aw  =  (—A  +  I)w  for  w  G  D(A). 

Then  the  initial  value  problem 


u'(t)  =  —Au(t) 
u(0)  =  u° 

has  the  unique  solution  u(t)  =  S(t)u°.  The  semigroup  S(t)  generated  by  -A,  S(t)  = 
“exp (—At)"  is  an  analytic  semigroup  of  linear  continuous  operators  in  Lp( Q). 

For  solving  an  initial  value  problem  with  an  inhomogeneous  boundary  condition  a 
special  solution  of  the  corresponding  elliptic  boundary  value  problem  is  needed.  Let 
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g  e  LP(T).  The  mapping,  which  assigns  to  g  the  solution  v  of  the  elliptic  boundary 
problem 

Av  —  v  =  0 

(2.1)  dv 

=  9, 

an 

is  denoted  by  N,  i.e.  v  =  Ng.  Transforming  the  heat  equation  with  u  =  we 4,  we 
obtain 

dw 

A w(t,x)  —  w(t,x)  —  —(t,x)  on  (0,T]  xfl 

(2.2)  iu(0,  x )  =  u°(x)  on  fl 

^-(t,x)  =  a(w(t, x)et)(fl  —  w(t, x)et)e~t  on  (0,T]  x  T. 
on 

Now  the  semigroup  approach  can  be  applied.  The  operator  —A  is  known  to  generate 
a  strongly  continuous  and  analytic  semigroup  (S(t),  t  >  0}  of  linear  continuous 
operators  in  LP(Q),  see  Friedman  [3].  TV  is  a  continuous  mapping  from  LP(T)  to 
kFp(Q)  for  all  s  <  1  +  1/p,  cf.  Triebel  [15]. 

Regarding  the  function  w  as  an  abstract  function  w  =  w(t)  with  values  in  the 
Banach  space  Wpa(Q)  the  nonlinear  Bochner  integral  equation 

t 

(2.3)  w(t)  =  J  AS{t  -  s)NB{tw{s))  ds  +  S(t)u° 

o 

is  obtained.  We  refer  to  Amann  [1],[2].  In  this  equation,  r  denotes  the  trace  operator 
and  B  is  the  Nemytskij  operator  defined  by 

(2.4)  B(v)(t,x)  =  a{v(x)et){'d  -  v(x)et)e~t,  v  G  C'(T). 

Here  the  trace  operator  maps  Wpa(Q)  into  C(T).  Inserting  the  backward  transfor¬ 
mation  w  =  we  get  in  turn 

t 

(2.5)  u(t)  —  J  AS(t  —  s)iVe(<~s)a(Tu(s))($  -  ru(s))ds  +  etS(t)u°. 

o 

For  each  a  G  Uad  we  get  a  unique  solution  u  G  C0,l5([0,  T],  W£a(Q,)).  This  solution 
satisfies  the  maximum  principle 

(2.6)  •d i  <  u(t,x )  <  i?2- 

Let  us  slightly  simplify  the  notation.  Denoting  the  kernel  of  the  integral  in  (2.5)  by 
k(t  —  s)  :=  AS(t  —  s)Ne^~s\  we  get 

t 

(2.7)  u(t)  =  J  k(t  -  s)a(ru(s))(i9  -  ru(s))ds  +  etS(t)u°. 

o 

The  right  hand  side  of  the  Bochner  integral  equation  (2.7)  depends  only  on  the 
boundary  values  of  u.  For  that  reason,  we  shall  investigate  this  equation  only  on 
the  boundary.  Therefore,  it  is  convenient  to  introduce  the  trace  of  u  by  x  =  tu 
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with  t  :  C0,l5([ 0,T],Wpa (&))  i— >  C([0,T]  x  T)  and  to  consider  the  boundary  integral 
equation 

t 

(2.8)  x(t)  =t  J  k{t  -  s)a{x(s)){rd  -  x{s))ds  +  TetS(t)u°. 

o 

Next  we  define  a  mapping  $  by 

(2.9)  {$a)(t)  =  x(t), 

where  x(t)  is  the  solution  of  the  boundary  integral  equation  (2.8).  It  is  also  possi¬ 
ble  to  work  with  weak  solutions  under  reasonable  assumptions,  but  in  this  case  the 
derivation  of  several  results  is  much  more  complicated.  For  the  reader  who  prefers 
to  work  with  other  techniques  we  will  write  all  equations  in  the  form  of  PDE.  Never¬ 
theless,  the  solutions  are  assumed  as  solutions  of  the  corresponding  Bochner  integral 
equation. 

In  Rosch  [9]  it  is  proved  that  $  is  Frechet  differentiable  from  Cl[D i,$2]  to  C([0,T]  x 
T)  at  a  point  a0.  Furthermore,  let  v  be  the  solution  of  the  initial  boundary  value 


problem 

dv.  . 

=  Axv(t,  x)  on  (0,T]  x  Q 

(2.10) 

v(0,x) 

=  Oonfl 

dv,  , 

=  (a'0(u0(t,  x))(tf  -  u0{t,  x ))  -  a0{u0{t,  x)))v 

+a(uo(t,x))('d  —  uo(t,x))  on  (0,T]  x  T. 

Then  the  Frechet  derivative  <fr'(a0)a  is  the  trace  of  the  solution  v  on  the  boundary  T. 

We  expose  the  dependence  of  v  with  respect  to  the  direction  and  write  for  instance  vp 
which  means  v  is  the  solution  of  (2.9)  for  a  =  (3.  This  notation  is  useful  for  derivation 
of  the  second  derivative  which  follows  in  the  next  section. 

3.  Second  Frechet  derivative 

In  this  section  we  want  to  prove  that  the  mapping  $  has  a  second  Frechet  derivative 
which  is  the  base  of  the  second  order  optimality  conditions. 

Theorem  3.1  (Existence  of  a  second  Frechet  derivative).  <P  has  a  second  Frechet 
derivative  as  mapping  from  C2^,^]  to  C([0,  T]  x  F)  in  every  point  ao  G  Uad- 
Moreover,  let  w  be  the  solution  of 

duu 

-^-(t,x)  =  A Xw(t,x)  on  (0,  f/xfi 

(3.1)  w( 0,  x)  =  0  on  fl 
d'w 

=  (ao(u0(t,2;))(tf -u{t,x))  -  2a'o(uo(t,x)))v0(t,x)v1(t,x) 
on 

+(a'0(u0(t, x))(fi  -  u(t,  x ))  -  a0(u0(t,  x)))w{t,  x) 

+(i(uo(t,x)){fi  -  u0{t,x))  -  'y(u0(t,x)))vp(t,x) 

+(/3'(u0(t,  x)){d  -  u0(t ,  x ))  -  f3(u0(t,  z)))u7(f,  x)on  (0,T]  x  F. 
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In  this  setting  uo  is  the  solution  of  (1.2)  for  a  =  ao-  Then  the  trace  of  the  solution 
w  is  the  second  Frechet  derivative  of  $  at  the  point  cto  in  the  directions  (3  and  7,  that 
means  $"(a0)[/?,  7]  =  tw. 

Proof:  To  prove  this  theorem  we  investigate  two  admissible  controls  a0  and  ai  = 
on  +  6  ‘  7  with  corresponding  solution  uo  and  u\  of  (1.2)  and  corresponding  first 
Frechet  derivative  v0  and  vx  in  direction  (3  (see  (2.10)).  Without  lost  of  generality 
let  ||7||  =  1.  Then  we  have  ||qi  -  a0||  =  e.  The  goal  of  the  next  steps  is  to  show 
that  the  remainder  term  has  the  property  v\—Vo  -e-w  =  o(e).  Now  we  discuss  the 
difference  6v  =  v  1  —  v0  which  solves  the  parabolic  problem 


dSv 

=  A xSv(t,x)  on  (0,Tj  x  Q 

(3.2) 

6u(0,  x ) 

=  0  on  Q 

dSv 

aT'1' 

=  (a^u^x))^  -  Ui(t,x))  -  a-i(ui(t,x)))vi 

(3.3) 

-(c4(u0(t,x))(i?  -  u0(t,x))  -  a0(u0(t,x)))v 0 
+(3{ux{t,x))(d  -  ui(t,x)) 

(3.4) 

-/3(u0(t,x))(i9  -  u0(t,x))  on  (0,T]  x  T. 

It  is  easy  to  see  that  only  the  boundary  condition  is  interesting.  For  that  reason  we 
discuss  only  this  part  in  the  next.  To  shorten  the  notation  we  drop  the  arguments  t 
and  x.  First  we  discuss  term  (3.4): 

f3(ui){’d-ui)-p(uo){'0-uo)  =  (3(uf)(d  uf]  (3{ti\)(d  Uq) 

+ /?(«i)(r?  -  u0)  - /3(u0)(i?  -  uo) 

(3'5)  =  -/3(u1)(u1-uo)+(/3(u1)-/3(uo))('i9-u0) 

=  (-/?(u0)  +  /3'(u0)(i9  -  u0))uy£  +  o(e) 

Now  we  discuss  term  (3.3): 

(ai(ui)($  -  Ui )  -  ai(wi))u!  -  (ao(uo)0?  -  u0)  -  a0(u0))uo 

(3.6)  =  {a\ (ui)($  -  ui)  -  ai(ui))vi  -  (a'^Ui)^  -  u0)  -  ai(uo))ux 

(3.7)  +(a[(u1)('d  -  u0))vi  -  (a'^uo)^  -  u0))v  1 

(3.8)  +(a'1(uQ)(i?  -  u0)  -  ai(u0))ui  -  (a'0(u0)(tf  -  u0)  -  a0{u0))vi 

(3.9)  +(ao(u0)(r?  -  u0)  -  a0(u0))ui  -  (a^uo)^  -  u0)  -  a0(u0))u0 
Next  we  handle  the  term  (3.6)-(3.9).  Term  (3.8)  is  easy: 

(a) (u0) (d  -  u0)  -  ai(u0))ui  -  (ao(uo)(i?  -  u0)  -  ao(u0))ui 

(3.10)  =  e(Y(uo)(i9  -  u0)  -  7(uo))ui 

=  e(7,(u0)('d  -  u0)  -  l{uo))v0  +  o(e) 

For  term  (3.7)  we  get 

(ai(ui)($  -  u0))u!  -  (a'^uoXtf  -  u0))u!  =  (a'j(ui)  -  ai(u0))(i?  -  u0)v  1 

(3.11)  =  a,1,(u0)('d  -  u0)u7ui£  +  o(e) 

=  ao(uo)(r?  -  u0)v7vp£  +  o(s) 
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Similar  we  deal  with  term  (3.6) 

(a[ -  ui)  -  ai(ui))wi  -  -  u0)  -  ai(u0))r>i 

(3.12)  =  (-ai(ui)(ui  -  uq)  -  (ai(«i)  -  ai(uo)))vi 
=  —  2a,1(ui)?;7?;/3e  +  o(e) 

Now  we  discuss  the  term 

(3.13)  r  =  Vi  —  v0  —  e  •  w 

which  is  the  remainder  term.  This  remainder  solves  a  parabolic  problem.  Using  the 
equations  (3.5)  and  (3.10)-(3.12)  we  get: 

Or 

—  (t,x)  =  A Xr(t,x)  on  (0,T]  x  Q 

(3.14)  r(0,  x)  =  0  on  Q, 

Or 

=  (a'Q(uo(t,x)){‘d-Uo(t,x))-ao(uo(t,x)))r+o(e)  on  (0,T]xT. 

For  that  reason  r  =  o(e)  holds  and  $  has  a  second  Frechet  derivative.  □ 


4.  Second  order  conditions  and  stability  estimates 

The  two  times  Frechet  differentiability  of  <f>  is  the  key  point  to  formulate  second 
order  conditions.  First  of  all  we  recall  the  necessary  first  order  condition.  In  Rosch 
[10]  we  find  the  two  formulations 

T 

(4.1)  \P'(ao)[/?]  ~  2// v/}(t,x)(u0{t,x)  -  q(t,x))dSxdt  >  0 

o  r 


(4.2)  $'(a0)[/3]  =2  J  J  p(u0{t,x))(ti  -  u0(t,x))y0{t,x)dSxdt  >  0. 

o  r 

where  y0  is  the  solution  of  the  adjoint  system  for  a  =  a0>  u  =  uo 
Ov 

--^(t,x)  =  Axy(t,x)  on  (0,T]  x  Q 

(4.3)  y(T,x)  =  Oonfi 
Ov 

^(t,x)  =  (a'(u(t,x))(‘d-u(t,x))-a(u(t,x)))y 
+u(t,x )  —  q(t,  x)  on  (0,T]  x  T. 

Based  on  theorem  3.1  about  the  second  Frechet  derivative  it  is  easy  to  formulate 
second  order  optimality  condition. 
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Theorem  4.1  (Necessary  second  order  optimality  condition).  Let  a0  be  an  optimal 
control  with  associated  state  u0.  For  all  admissible  directions  (3  =  a  -  a0,  a  e  Uad 
with  #'(<*„)  [0  =  0  the  necessary  second  order  condition 

T 

(4.4)  ty"(a0)[f3,(3\=2  J  J  wpp{t,x){uQ{t,x)  -  q{t,x))  +  v0{t,x)2dSxdt  >  0 

o  r 


holds. 


In  this  setting  wpp  is  the  solution  of  (3.1)  with  and  7  =  /?.  The  proof  is  very  simple 
and  for  that  reason  we  resign  to  sketch  it  here. 

Proposition:  It  makes  no  sense  to  formulate  sufficient  second  order  conditions.  For 
very  simple  cases  such  properties  can  not  hold. 

Let  us  now  discuss  sufficient  second  order  optimality  conditions.  Usually  sufficient 
optimality  conditions  require  one  or  both  of  the  conditions 

(4.5)  ®'(ao)[/3]  >  HPW 

(4.6)  *"(oo)[M  >  m\2 

for  all  admissible  direction  (3  and  a  positive  6.  First  the  norm  of  f3  is  the  norm  of  the 
differentiation  but  in  several  applications  it  can  be  weakened  with  a  other  norm. 

We  will  see  now  that  such  conditions  cannot  hold  for  the  simplest  case.  For  that 
purpose  we  choose  an  example  for  which  is  'k(ao)  =  0,  that  means  there  exists  an 
exact  solution,  thus  we  can  replace  the  “measurements”  q  by  the  corresponding  state 
u0.  Using  formula  (4.1)  we  get  easy  ^'(a0)[/?]  =  0  for  all  directions  (3.  For  that  reason 
a  condition  of  type  (4.5)  cannot  work  here. 

We  assume  now  that  a  condition  of  type  (4.6)  holds  which  a  suitable  norm. 


(4.7)  *"(a0W(3\>6m\2 

We  investigate  an  admissible  control  a  =  a0  +  (3  with  corresponding  state  u.  Using 
the  Taylor  expansion  we  have 


(4.8) 


*  (a)  =  *  (a„)  +  *'(ao)[0  +  ^>o)M  +  o{\\(3\\h) 
=  l*"(a0)[f3,f3\  +  om2c>) 


If  we  have  in  (4.6)  the  C2-norm,  then  we  get 
(4.9)  d'M  >  smb 


for  a  positive  6.  Using  the  definition  of  IF 


(4.10) 


u  ~  Mo||i2([o,T]xr)  —  mb 


holds.  With  a  new  6  >  0  we  get 


(4.11)  ||u  -  uo||z,2([o,r]xr)  >  <5||a  -  aollc2 

We  can  interpret  inequality  (4.11)  as  a  stability  estimate.  For  the  simple  case  Q  = 
[0, 1]  we  find  several  stability  estimates  in  Rosch  [11].  An  essential  point  in  this  paper 
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is  the  discussion  under  which  assumptions  estimates  of  the  more  general  type 

(4.12)  ||u-tio||  >  c\\a-  a0||" 

can  be  hold.  First  of  all  the  norm  of  the  difference  of  the  controls  has  to  be  evaluated 
only  on  a  smaller  set  as  [#i,i?2]  the  so-called  reference  set.  Variations  outside  of  this 
set  have  no  influence  of  the  state.  This  fact  compiles  several  investigation  but  it  is 
not  a  crucial  point.  We  only  recall  here  the  definition  of  reference  sets. 

Definition:  Let  a  e  Uarj  and  u  the  corresponding  solution  of  (1.2). 

The  set  M  :=  {u  :  u  =  u(t ,  x),  t  G  [0,  T\  x  e  T}  is  called  reference  set  of  a. 

In  the  illustrative  example  and  the  concluding  remark  of  this  paper  we  can  see  that 
for  a  very  simple  example  we  can  not  expect  Lipschitz  type  estimates 

(4.13)  ||u-u0||  >c||a-a0|| 

with  the  same  norms  for  u  and  a  (for  instance  C  or  Lf). 

Inequality  (4.11)  requires  much  more  than  that  namely  a  Lipschitz  estimate  of  the 
i2-norm  of  u  with  respect  to  the  C2-norm  of  a.  May  be  it  is  possible  to  weak  the 
norm  on  the  right-hand  side  of  inequality  (4.11).  To  ensure  that  all  terms  of  the 
second  Frechet  derivative  make  sense  we  must  necessarily  require  that  the  increment 
a~aQ  belongs  to  C'0,1.  Thus  the  L2-norm  or  a  weaker  norm  for  a-ao  is  unimaginable. 

Summarizing  these  deliberations,  we  notice  that  already  for  simply  cases  neither 
the  sufficient  estimates  of  type  (4.5)  nor  sufficient  estimates  of  type  (4.6)  hold.  Nev¬ 
ertheless,  the  investigation  of  the  second  derivative  delivers  useful  results.  First  of 
all  we  have  the  necessary  second  order  condition.  Second  we  are  able  to  construct 
second  order  descent  algorithm.  In  this  context  second  order  means  using  second 
order  information  not  quadratical  convergence. 

In  this  context  it  seems  to  be  a  way  out  to  use  an  additional  Tikhonov-term.  But 
this  way  has  disadvantages.  Because  of  the  dependence  of  the  temperature  of  a  we 
have  to  discuss  what  means  ||a|[|2.  If  we  choose  the  norm  over  the  reference  set  (for 
instance  L2{M))  then  we  get  the  problem  that  every  a  generates  its  own  reference  set. 
For  that  reason  the  norm  is  not  differentiable.  The  use  of  ||o!(«)||L2((o,r)xr)  geneates 
additional  differentiation  term  because  of  the  dependence  from  a.  Possibilities  to 
overcome  this  problems  could  be  the  use  of  ||o;(q)||L2((o,T)xr)  or  an  iterative  Tikhonov 
regularization  ||a(Un)||r,2((o,T)xr)- 

The  investigations  of  this  identification  problems  shows  that  for  such  problems  it 
is  necessary  to  find  new  ways.  It  is  imaginable  that  the  function  of  the  sufficient 
second  order  condition  can  be  fulfilled  by  a  stability  estimate  of  the  type  (4.12).  In 
[11]  we  find  several  estimates  of  this  form  for  the  one-dimensional  case.  For  instance 
it  is  possible  to  get  the  inequality 

(4-14)  || u  -  u0 ||i2  >  c||a  -  ao||c(M) 

where  M  denotes  the  reference  set  of  a0.  By  means  of  such  properties  and  the 
knowledge  of  the  second  order  information  it  should  be  possible  to  construct  numer¬ 
ical  algorithms  and  to  prove  convergence  rates.  These  are  the  goals  of  the  future. 
The  investigation  of  this  papers  show  that  the  standard  way  can  not  be  gone  for  this 
type  of  identification  problems. 
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Abstract.  A  model-based  LQR  method  for  controlling  vibrations  in  cylindrical  shells  is 
presented.  Surface-mounted  piezoceramic  patches  are  employed  as  actuators  which  leads 
to  unbounded  control  input  operators.  Modified  Donnell-Mushtari  shell  equations  incorpo¬ 
rating  strong  or  Kelvin- Voigt  damping  are  used  to  model  the  system.  The  model  is  then 
abstractly  formulated  in  terms  of  sesquilinear  forms.  This  provides  a  framework  amenable 
for  proving  model  well-posedness  and  convergence  of  LQR  gains  using  analytic  semigroup 
results  combined  with  LQR  theory  for  unbounded  input  operators.  Finally,  numerical  ex¬ 
amples  demonstrating  the  effectiveness  of  the  method  are  presented. 
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1.  Introduction 

The  use  of  shell  models  to  describe  structural  dynamics  is  pervasive  in  applications 
ranging  from  noise  reduction  in  aircraft  to  flow  control  in  flexible  pipes.  While  general 
shell  equations  can  be  used  in  a  variety  of  geometries,  they  all  share  the  property  that 
component  displacements  are  coupled  due  to  the  geometry.  This  leads  to  significant 
challenges  when  developing  appropriate  models  and  approximation  techniques,  and 
constructing  effective  controllers. 

In  this  paper,  we  consider  cylindrical  shells  due  to  their  prevalence  in  applications. 
Control  is  provided  by  piezoceramic  patches  bonded  in  pairs  to  the  surface  of  the  shell. 
These  transducers  provide  significant  actuating  capabilities  due  to  the  piezoelectric 
effect  in  which  input  voltages  generate  strains  in  the  patches.  Utilization  of  the 
converse  piezoelectric  effect  (strains  produce  voltages)  also  allows  the  patches  to  be 
employed  as  sensors.  When  combined  with  their  light  weight,  space  efficiency  and 
reasonable  cost,  these  properties  make  the  patches  highly  effective  control  elements 
in  a  variety  of  applications.  Prom  a  mathematical  perspective,  the  use  of  surface- 
mounted  piezoceramic  patches  leads  to  unbounded  control  input  operators. 

*This  research  was  supported  in  part  by  the  National  Aeronautics  and  Space  Administration 
under  NASA  Contract  Number  NAS1-19480  while  RCS  was  a  visiting  scientist  at  the  Institute 
for  Computer  Applications  in  Science  and  Engineering  (ICASE),  NASA  Langley  Research  Center, 
Hampton,  VA  23681.  Additional  support  was  also  provided  in  part  under  NASA  grant  NAG- 1-1600. 
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Experimental  work  has  already  demonstrated  the  potential  for  success  when  em¬ 
ploying  the  patches  as  actuators  in  applications  involving  cylindrical  shells  [8,  13]. 
However,  these  initial  investigations  have  not,  in  general,  utilized  the  full  potential 
of  the  patches  due  to  limitations  in  hardware,  models,  approximation  techniques  and 
control  laws.  For  example,  a  common  means  of  approximating  shell  dynamics  to  yield 
systems  which  are  ultimately  used  to  calculate  control  gains  is  through  modal  expan¬ 
sions  [10].  However,  closed  form  expressions  for  the  modes  can  be  determined  only 
for  a  limited  set  of  models  with  severely  restrictive  boundary  conditions.  The  use  of 
incorrect  modes  when  calculating  control  gains  can  lead  to  loss  of  control  authority 
and  possible  controller  instabilities. 

In  this  paper,  we  present  a  model-based  method  for  controlling  shell  vibrations. 
For  simplicity,  the  Donnell-Mushtari  shells  equations  with  Kelvin- Voigt  damping  are 
used  as  a  model  (the  assumption  of  strong  or  Kelvin- Voigt  damping  is  reasonable  and 
typical  for  many  shell  materials  such  as  aluminum).  The  methods  are  general,  how¬ 
ever,  and  can  be  applied  to  higher-order  models  (e.g.,  Byrne-Fliigge-Lur’ye  model)  if 
the  application  warrants.  A  general  Galerkin  method  based  on  splines  is  then  used  to 
discretize  the  infinite  dimensional  system.  Through  the  choice  of  basis,  the  method  is 
significantly  more  flexible  than  general  modal  methods  when  considering  the  bound¬ 
ary  conditions  and  material  nonhomogeneities  which  arise  in  typical  applications. 

The  model  and  approximate  system  are  then  employed  in  an  LQR  full  state  feed¬ 
back  theory  to  obtain  feedback  gains  and,  ultimately,  controlling  voltages  to  the 
patches.  While  full  state  measurements  are  not  available  using  current  instrumen¬ 
tation,  and  hence  the  techniques  cannot  directly  be  implemented  in  experiments, 
they  provide  an  important  first  step  in  the  design  of  effective  compensators  based 
on  state  estimates  calculated  using  a  limited  number  of  observations  (see  [5]).  The 
consideration  of  the  LQR  performance  also  illustrates  properties  of  the  system  and 
model-based  control  techniques  and  facilitates  investigations  regarding  issues  such  as 
patch  number  and  configuration.  Finally,  the  consideration  of  the  problem  provides 
a  step  toward  the  development  of  model-based  controllers  for  fully  coupled  structural 
acoustic  and  fluid/structure  systems  involving  cylindrical  shells. 

The  strong  and  weak  forms  of  the  Donnell-Mushtari  shells  equations  are  outlined  in 
Section  2.  In  presenting  this  model,  care  is  taken  to  include  both  passive  (material) 
and  active  (actuator)  contributions  due  to  the  patches.  An  abstract  form  of  the 
model,  based  on  sesquilinear  forms,  is  also  presented.  This  provides  a  natural  setting 
to  prove  model  well-posedness  and  convergence  properties  of  the  LQR  control  law. 
LQR  full  state  feedback  laws  for  systems  with  no  exogenous  force  or  forces  which 
are  periodic  in  time  are  presented  in  Section  3.  In  the  former  case,  convergence  of 
the  approximate  suboptimal  gains  to  the  optimal  gains  for  the  infinite  dimensional 
system  is  proven  using  analytic  semigroup  theory  in  combination  with  LQR  results  for 
unbounded  control  input  operators.  A  Fourier-Galerkin  method  for  approximating 
the  system  dynamics  is  outlined  in  Section  4,  and  the  effectiveness  of  the  LQR  method 
for  periodic  forces  is  demonstrated  through  a  numerical  example  in  Section  5.  This 
example  demonstrates  that  through  the  use  of  the  model-based  methodology  with 
general  Galerkin  approximations,  significant  attenuation  in  shell  vibrations  can  be 
obtained  using  piezoceramic  patches. 
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2.  PDE  Model 

The  system  under  consideration  consists  of  a  thin  cylindrical  shell  with  surface- 
mounted  piezoceramic  patches.  It  is  assumed  that  the  patches  are  mounted  in  pairs 
with  edges  aligned  with  the  circumferential  and  longitudinal  axes  of  the  shell.  The 
edges  of  the  shell  are  taken  to  be  fixed  in  accordance  with  common  experimental 
clamping  techniques. 


Figure  1.  Thin  cylindrical  shell  with  surface  mounted  piezoceramic  patches. 


To  specify  the  geometry  for  the  corresponding  model,  we  consider  the  longitudinal 
direction  to  be  aligned  along  the  z-axis  as  depicted  in  Figure  1 .  The  displacements 
of  the  middle  surface  in  the  longitudinal,  circumferential  and  transverse  directions 
are  denoted  by  u,  v  and  w,  respectively  while  the  length,  thickness  and  radius  of  the 
shell  are  denoted  by  l,  h,  R.  The  region  occupied  by  the  middle  surface  is  denoted  by 
r0.  Finally,  the  shell  is  assumed  to  have  mass  density  p,  Young’s  modulus  E,  Poisson 
ratio  v,  Kelvin- Voigt  damping  coefficient  cD  and  air  damping  coefficient  p. 

Actuator  and/or  sensor  capabilities  are  provided  by  s  pairs  of  surface-mounted 
piezoceramic  patches.  It  is  assumed  that  all  the  patches  have  thickness  hpe,  Young’s 
modulus  Epe,  Poisson  ratio  i/pe  and  Kelvin- Voigt  damping  coefficient  cDpe.  Further¬ 
more,  it  is  assumed  that  the  glue  bonding  layer  provides  negligible  contribution  to  the 
structural  dynamics  (the  reader  is  referred  to  [3,  5]  for  details  concerning  the  incor¬ 
poration  of  differing  patch  characteristics  and  bonding  layers  in  the  ensuing  models). 
The  region  covered  by  the  ith  patch  pair,  with  edges  at  xu,  x2i,  On,  02i,  is  delineated 
by  the  characteristic  function 


Xpe;(M) 


1  ,  xu  <  x  <  x2i  ,  On  <  0  <  02i 
0  ,  otherwise  . 


The  indicator  function  Spei(x,0)  =  Sit2(x)Sii2(0)  ,  where 


Si,2(x)  = 


1 

0 

-1 


,  x  <  (xu  +  x2i)/2 
,  x  =  ( xu  +  x2i)/2 
,  x  >  (xu  +  x2i)/2 


Si, 2(0) 


1  ,  0  <  (On  +  02i)/2 

0  ,  0  =  (On  +  02i)/2 

—  1  ,  9  >  (On  +  92i)/2 
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delineates  the  sense  of  the  forces  generated  by  the  ith  pair.  The  symmetry  of 
the  function  arises  from  the  property  that  for  homogeneous  patches  having  uni¬ 
form  thickness,  equal  but  opposite  strains  are  generated  about  the  point  = 

((Xli  +  %2 i)/2,  {9\  i  +  02i)/2)- 

2.1.  Strong  Form  of  the  Modeling  Equation.  We  consider  here  the  modified  Donnell- 
Mushtari  equations 

Rphd^~R~d^~^f-Rqx~Rh pei (  } 

a2u  dNe  rNx9_  ^d(Ne)pe 

Rphw~~w~R~d^-Rqe~h pei (  } 


_  ,  d2w  _  dw 

RphW  +  m 


am,  i  am,  a-M,, 

dx 2  R  df)2  dxde  0 


=  Rqn  R 

i= 1  . 


d\Mx)vei 

.  1  d2(Mg)pei~\ 

dx 2 

R  862 

as  a  model  for  the  thin  shell  dynamics.  As  detailed  in  [3,  5,  9],  these  equations  are 
obtained  through  force  and  moment  balancing  with  only  low  order  terms  retained. 
Here  MX)Me,  Mgx  and  Mxg  are  internal  moments  while  Nx,Ng,  Ngx  and  Nxg  denote 
internal  force  resultants.  External  surface  forces  are  denoted  by  qx,qe,Qn  whereas 
the  external  resultants  (line  moments  and  forces)  generated  by  the  ith  patch  pair  are 
designated  by  ( Mx)pei ,  (Mg)pei,  {Nx)pei,(Ng)pei. 

Expressions  for  the  internal  force  and  moment  resultants  are  derived  under  the 
assumption  that  stress  is  proportional  to  a  linear  combination  of  strain  and  strain 
rate.  This  yields  a  model  which  incorporates  Kelvin- Voigt  or  strong  internal  damping. 
As  detailed  in  [3,  5],  the  resultants  Nx,  Nx9,  NgX}  Mx,  Mxg,  Mgx  derived  under  this 
assumption  are 


2Epe/lpe 


(2.2)  Nx  =  ^(e.  +  ^  +  Efr; =(*  + 

1  V  i=l  1  upe 


Vpe^'O')  Xpeifai 


+  73^2  (d*  +  "4)  +  (4  +  VpeSe)  Xpei(x,  0) 

1  V  i=  1  1  Upe 

Nxg  =  Ngx  =  2(1  +  v^Xf>  +  E  (l  +  upefXeXpe^X’  6) 

Eh 3  „  .  ^  2  Evea3  ,  .  ,  ... 

M x  =  — - (Kx  +  VKg)  +  N  JT  («s  +  ^Kg)  Xwfo  9) 

12(^1  —  V)  i=1  of1  Vpe) 


12(1  —  v2) 


(kx  +  vke)  + 


hi  3(!  “  upe) 


(kx  Vpeko)  Xpeifai  $) 
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„ ,  , ,  Eh 3  A  Evea3  ,  n. 

M -  ~  M>‘  =  WT^T  +  ^3(U^)TX’‘-(x’{ 9) 


24(1  +  v) 


where  the  constant  a3  =  (h/2  +  hpe)3  —  h3/8  results  from  integration  through  the 
thickness  of  the  patch.  Expressions  for  the  resultants  Ns  and  Mg  can  be  obtained  by 
replacing  ex,£g,Kx,  Kg  in  the  expressions  for  Nx  and  Mx  by  Eg,Ex,Kg,Kx,  respectively. 
The  midsurface  strains  and  changes  in  curvature  for  the  Donnell-Mushtari  model  are 


1  dv  w 
RdB  +  R 

1  d2w 


6x9  dx  +  R  dO 


d2w  1  d2w  2  d2w 

Kx  =  —dx2  ’  =  ~R2~d92  ’  T  =  -Rdxd8  ' 

Note  that  for  the  undamped  shell  which  is  devoid  of  patches,  the  resultant  equations 
(2.2)  reduce  to  the  classical  Donnell-Mushtari  expressions 


(1  - 1/2) 


(1  —  v2 


Nxg  =  Ngx  = 


du  u  f  dv 
dx  +  R\dd  +  U) 

1  dv  w  du 
RdB  +  R  +  Vdx 


Eh  I"  dv  1  du 

2(1  +  v)  [ lh  +  RdB 


12(1  -  v2 


12(1  -  v2)  [l?2  do2 


dx 2  R2  dd2\ 
1  d2w  d2w 

s,n  T  ^  m  n 


Mxg  =  Mgx  = 


\2R{\  +  v)  dxdB 


(e.g.,  see  [9]). 

To  characterize  the  external  or  active  patch  contributions,  it  is  typical  to  start 
with  the  assumption  that  the  strains  generated  by  a  patch  are  proportional  to  the 
applied  voltage  [3].  Since  differing  voltages  can  be  applied  to  the  outer  and  inner 
patches  in  the  pair,  we  will  differentiate  between  the  two  with  Vn (t)  and  Vi3(t)  used 
to  denote  the  voltages  to  the  outer  and  inner  patches  in  the  ith  pair,  respectively. 
The  proportionality  constant  relating  the  generated  strain  to  the  input  voltage  is 
designated  by  d3i.  As  detailed  in  [3],  the  total  external  moments  and  forces  generated 
by  the  patches  are 

\  —Epe  d3 1  f/a2  .  <23  \  Tr  ( «2  a3\  ' 

(  *)pei  ~1_‘  h  Xpeii  , e)  2  +  3R)  Vil  {  2  3 r)  Vi2 


mpe=T 


Vpe  2/ipg 


-XpeifaOWil-Va 


(N)  _  ~EP*  .^iy 

V^)pei  1  h  Xpt 

x  1  bpe 


i{x,6)Spei{x,6)  (hp 


+  ^)Va  +  ft, 


iiNd)pei  =  1  _  r  rf31  XpeAx,  S)Spei(x,  e)[Vn  -  Va] 

J-  t'np. 


252 


LQR  Control  of  Shell  Vibrations  via  Piezoceramic  Actuators 


where  o2  =  (h/2  +  hpe f  -  h2/ 4  and  a3  =  (h/2  +  hpef  -  h3/8.  When  substituted 
into  (2.1),  the  expressions  (2.4)  provide  the  input  from  the  patches  when  voltages  are 
applied. 

Finally,  the  fixed-edge  boundary  conditions 


(2.5) 


U  =  V  =  w  = 


dw 

dx 


x  =  0,  l 


are  used  to  model  the  end  behavior  of  the  shell.  These  boundary  conditions  are 
appropriate  for  experimental  setups  in  which  heavy  endcaps  prevent  edge  movement. 
Note  that  alternative  boundary  conditions  such  as  simply  supported  or  “almost  fixed” 
(see  [4])  can  be  employed  if  edge  movement  is  suspected. 


2.2.  Weak  Form  of  Modeling  Equations.  The  strong  form  (2.1)  of  the  modeling 
equations  requires  first  and  second  derivatives  of  the  moment  and  force  resultants. 
As  noted  in  (2.2)  and  (2.4),  both  the  internal  and  external  moment  and  force  resul¬ 
tants  are  discontinuous  due  to  the  piezoceramic  patches.  Hence  formal  analysis  and 
approximation  using  the  strong  form  of  the  modeling  equations  lead  to  difficulties 
due  to  differentiation  of  Dirac  distributions. 

To  alleviate  these  difficulties,  it  is  advantageous  to  consider  a  weak  form  of  the 
modeling  equations  which  can  be  derived  from  Hamilton’s  principle  (energy  consider¬ 
ations).  While  equivalent  to  the  strong  form  under  suitable  smoothness  assumptions, 
the  weak  form  provides  a  more  natural  setting  for  analysis  and  approximation. 

The  state  variables  for  the  problem  in  second-order  form  are  taken  to  be  y  = 
(u,v,w)  in  the  state  space  H  =  L2(r0)  xT2(r0)  xL2(r0).  For  the  fixed-edge  boundary 
conditions  (2.5),  the  space  of  test  functions  is  taken  to  be  V  =  H^(T0)  x  H^(T0)  x 
Ho2(r0)  where 

Hq(To)  =  {t?  €  Hl(r0)  1 77(0)  =  v(i)  =  0} 

Hq(T0)  =  {77  e  H2(r0)  1 77(0)  =  77X(0)  =  77(f)  =  r)x{i)  =  0}  . 

For  $  =  ( u,v,w )  and  \I>  =  (77!,  772, 773),  the  H  and  V  inner  products  are  taken  to  be 

($,¥)#  =  /  phuTj1d"/+  /  phvrj2d'y+  /  phu>Tj3d'y 

Jvo  •'I'o  "^0 


and 

{(E,Epe)$,<S)v  =  l  {j 


Eh 


drji  1  .  ,  drii 

+2 R{1~v)eid,W 


1=1  vpe 


dr]  1  1  ,  ,  dr/i 

\SX  +  Vpe£o)-Q^  +  277 vpe)exS  qq 


d'y 


+ 


/Jr 


Eh 


drji  l  .  ,  dr\2 

(eg  +  vex)-QQ  +  2^(1  —  v}ex6~ffa 


1=1  L 


.  .dr]  2  1  ,  s  dr]  2 

(£0  +  Vpe£x)-QQ  +  277H  upe)exB 


dx 


]} 


d'y 
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+ 


m 


V‘ 

h 2 


1  .  ._  h?  .  .  d2ri3 

-{ee  +  vex)r]3  -  —(kx  +  vne)-^ 


,d2r)3  h 2  d2ri3 

-  12R2  9  +  VK*}  qq 2  12R  1  V^Tdxde 


2  -Epe  /  m 


^Jj^(eg  +  Vpe£x)V3  -  a3{Kx  +  VpeKo)^£ 


d2Vz 


dx2 


as,  ,  .d2r]3  as  d2% 

r2  (k 6  +  UpeKx)  d62  R  (  pe)T  dxde 


dj 


where  £x,£g,£xg,  kx,  Kg,r  are  defined  in  (2.3)  and  d*y  =  RdOdx.  The  dependence  of 
the  inner  product  on  the  Young’s  moduli  is  explicitly  included  in  the  definition  to 
provide  a  notation  for  defining  analogous  damping  expressions  later  in  this  work. 
The  weak  form  of  (2.1),  as  derived  in  [5]  from  energy  principles,  is  given  by 


Rph^rh+RN:  dn 


*-d,7+N^-Rwi-RY.m 


drji 
pei  dx 


^7  =  0 


L{r^  +  n‘1W  +  -  ***  -  twu  t }  ^  =  0 

(2.6) 

r  f  ,  d2w_  dw_  Ar_  d2rj 3  1  d2rj3 

L\mW1’3 +  «  ”3  +  m ~ RM‘w  -  rm'W 


\R(Mx)pei^  +  -(Me\ 


0“V3 

' pe '  89 2 


=  0 


for  all  T  =  (771,772,%)  G  V.  A  comparison  between  (2.6)  and  (2.1)  illustrates  that 
in  the  weak  form,  derivatives  are  transferred  from  the  discontinuous  resultants  onto 
suitably  smooth  test  functions.  This  alleviates  the  difficulties  associated  with  the 
discontinuities  and  reduces  smoothness  requirements  on  approximate  solutions. 


2.3.  Abstract  Formulation.  To  define  appropriate  sesquilinear  forms,  we  group  stiff¬ 
ness  components  separately  from  damping  components.  To  this  end,  we  define 
Ui  :  V  x  V  -*■  C ,  i  =  1, 2  by 

cn($,T)  =  <(£?,  Epe)$,t>)v 

ff2(*,*)  =  ({cD,cDpt)®,V)v  +  J  pwfjsd'y. 

Note  that  {(cd,  cDpJ$,  ^)v  differs  from  {(E,  Epe)$,  T)y  only  in  that  Young’s  moduli 
are  replaced  by  Kelvin- Voigt  damping  coefficients.  It  can  be  directly  verified  that  the 
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stiffness  form  o\ 

satisfies 

(HI) 

|<Ti(<f>,  T)|  <  Ci|$|y[T|V'  ,  for  some  Cj  6  K 

(Bounded) 

(H2) 

Re<Ti(<E>,  <F)  >  c2  $ \v  ,  for  some  c2  >  0 

(R-Elliptic) 

(H3) 

ffi($,tf)  =<71($,$) 

(Symmetric) 

for  all  <h,  T  £  V . 

Moreover,  the  damping  term  a2  satisfies 

(H4) 

|cr2($,  <Ef)|  <  c3|$|y|T|v  ,  for  some  c3  £  M 

(Bounded) 

(H5) 

Recr2(<f>,  <f>)  >  c4|$|y  ,  for  some  c4  >  0 

(V-Elliptic) . 

Remark  1.  The  symmetry  of  <Ti  is  dependent  upon  the  choice  of  shell  model  and  ul¬ 
timately  reflects  the  Maxwell-Betti  reciprocity  theorem.  While  the  Donnell-Mushtari 
model  yields  a  symmetric  sesquilinear  form  <ti  ,  other  models  such  as  the  Timoshenko 
shell  model  will  not  yield  a  symmetric  form. 

To  represent  control  contributions,  let  U  =  Rs  denote  the  Hilbert  space  of  control 
inputs  and  define  B  €  C(U,  V *)  by 

(Bu(t),  X/  |  (Nx)pei  +  ~ft(No)pei-QQ 

for  T  e  V.  Here  (•,  -)v.  v  denotes  the  usual  duality  product.  Finally,  with  the 
definition  g  =  (1  /ph)[qx,'qg,qn},  we  can  write  the  weak  form  (2.6)  in  the  abstract 
variational  form 

(2.8)  {y(t),V)v,  v  +  a2(y{t),  9)  +  <n(y(t),  V)  =  ( Bu{t )  +  g(t),  V)v.y  . 

To  pose  the  problem  in  a  first-order  form  amenable  for  control  applications,  we 
define  the  product  spaces  7d  =  V  x  H  and  V  =  V  x  V  with  the  norms 

l(0i.  ^2)  Iw  —  I0ilv  +  |02|  H 

1(01.  02) | V  =  101  Iv  +  102 1 v-  ' 

The  state  is  taken  to  be  z(t)  =  (y{t),y{t))  €  H.  Finally,  the  product  space  forcing 
terms  are  formulated  as 

(2-9)  g{t)  =  ,  Bu{t)  =  Bu^  ■ 

The  weak  form  (2.8)  can  then  be  rewritten  as 

(i(f),  A)v»  v  +  a(z(t),  A)  =  (Bu(t)  +  g(t),  A)v.>v  for  A  £  V 
z{  0)  =  Z0  =  (yo,yi) 


(2.10) 
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where  a  :  V  x  V  — >  C  is  given  by 


for  <f>  =  ((pi,  <p2),ip  =  (ipi,  1P2)  G  V.  As  proven  in  [5,  page  109],  a  is  V  continuous  and 
for  A  >  0,  a(-,  •)  +  A  (•,  -)H  is  V-elliptic.  Prom  the  continuity  of  a,  it  follows  that  one 
can  define  an  operator  A  G  C(V,  V*)  by  a( T,  A)  =  (AT,  A)v*,v  ■ 

To  obtain  a  strong  form  of  the  first-order  system  which  is  appropriate  for  control 
purposes,  consider  the  system  operator 


(2.11) 


domA  =  {((pi,  (P2)  G  Tl\(p2  G  V,  A\(p\  +  ^2^2  G  H} 


A  = 


0 

-A\ 


I 

—A2 


with  Ai,  A2  €  C(V,  V*)  defined  by 

(Ai<pl,<p2)V»y  =  &i(<pi,<p2)  >  *  =  1>2  . 

It  should  be  notated  that  A  is  the  negative  of  the  restriction  to  domA  of  A  G  £(V,  V*) 
so  that  <t( T,  A)  =  (-AT,  A)n  for  T  G  domA,  A  G  V. 

A  strong  form  of  the  abstract  system  model  is  then  given  by 


(2.12) 


z(t)  =  Az(t)  +  Bu(t )  +  g(t)  in  V*  =  V  x  V* 
z(0)  =  z0 . 


The  rigorous  equivalence  of  solutions  is  established  through  the  following  theorems. 


Theorem  1.  Under  Hypotheses  (H1)-(H5)  on  cq  and  <t2,  A  generates  an  analytic 
semigroup  T(t)  on  and  V*.  In  terms  of  this  semigroup,  the  representation 

(2.13)  z(t)  =  T(t)zo  +  J  T(t  -  s)[Bu(s)  +  g(s)]ds 

defines  a  mild  solution  to  (2.12)  for  zq  G  V*  and  Bu+g  G  T2((0,  T);  V*).  Furthermore, 
this  semigroup  is  (uniformly)  exponentially  stable  on  V,7i  and  V*. 

Theorem  2.  Let  zsg  denote  the  semigroup  solution  to  (2.12)  given  by  (2.13)  and  let 
vvar  denote  the  weak  solution  to  (2.8).  Under  hypotheses  (H1)-(H5),  it  follows  that 
zsg(z 0,  T)  =  zvar(z0,  J7)  for  zq&U  and  T  =  Bu  +  g  G  L2(( 0,  T);  V*). 

Following  the  convention  of  [14],  we  will  use  the  same  notation  for  the  semigroups 
defined  on  V,  H  and  V*  since  each  semigroup  is  an  extension  or  restriction  ofjthe 
others.  Note  that  domA  defined  in  (2.11)  is  actually  dom^A,  the  domain  of  A  as 
a  generator  of  T(t)  in  H.  As  detailed  in  Lemma  3.6.1  and  Theorem  3.6.1  of  [14] 
(see  also  Section  IV.6  of  [12]  and  Chapter  2,  Theorem  5.2  of  [11]),  the  property  that 
A  generates  an  analytic  semigroup  on  V,H  and  V*  results  from  the  continuity  and 
V-ellipticity  of  a.  The  exponential  stability  of  T(t)  on  H  for  second-order  systems 
with  strong  damping  is  demonstrated  in  [1]  while  the  exponential  stability  of  T(t) 
on  V  and  V*  in  this  setting  is  proven  in  Lemma  3.3  of  [2],  Finally,  Theorem  2  is  a 
reformulation  of  Theorem  4.14  of  [5]  and  details  can  be  found  therein. 
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3.  LQR  Control  Problem 

In  the  last  section,  the  PDE  system  modeling  the  dynamics  of  the  thin  shell  with 
surface-mounted  piezoceramic  actuators  was  written  in  the  abstract  first-order  form 

z(t)  —  Az(t)  +  Bu(t)  +  g{t) 

z{  0)  =  z0 

in  V*.  In  this  section,  LQR  control  results  for  both  the  original  infinite  dimensional 
problem  and  approximating  finite  dimensional  problems  will  be  discussed.  Two  cases 
will  be  considered,  namely  when  g  =  0  and  g  is  periodic  in  time.  In  both  cases,  it  is 
assumed  that  state  observations  in  an  observation  space  Y  have  the  form 

(3.1)  zob(t)  =  Cz(t ) 

where  C  G  C(H,Y)  is  bounded.  The  assumption  that  C  is  bounded  is  made  to 
simplify  the  exposition  and  the  reader  is  referred  to  [2]  for  arguments  pertaining  to 
the  case  of  unbounded  observation  operators. 

3.1.  No  Exogenous  Input.  For  the  case  in  which  g  =  0,  the  infinite  horizon  prob¬ 
lem  concerns  the  determination  of  a  control  u  which  minimizes  the  quadratic  cost 
functional 

(3.2)  J(u,zo)  =  {\Cz{t)\2Y  +  [R}/2u{t)\l}dt 

subject  to 

z{t)  =  Az(t)  +  Bu(t) 
z(0)  =  z0 . 

The  positive,  self-adjoint  operator  71  is  used  to  weight  various  components  of  the 
control. 

As  detailed  in  [2,  5],  if  (A,B)  is  stabilizable  and  (A,C)  is  detectable,  then  the 
optimal  control  minimizing  (3.2)  is  given  by 

u(t)  =  -'RTxB*f\.z{t) 

where  II  solves  the  algebraic  Riccati  equation 

(^4*n  +  UA-  IlBTl-'B'n  +  C*C)z  =  0  for  all  z  G  V 

and  z(t)  =  S(t)z0.  Here  S(t)  is  the  closed  loop  semigroup  generated  by  A-B7lrlB*U. 

For  implementation  purposes,  it  is  necessary  to  define  an  approximate  system  and 
controls,  and  determine  convergence  criteria  for  these  approximate  controls  when  fed 
back  into  the  infinite  dimensional  system.  The  approximations  are  considered  in  a 
Galerkin  framework  with  trajectories  evolving  in  the  finite  dimensional  subspaces 
C  V  C  H.  It  is  assumed  that  the  approximation  method  satisfies  the  standard 
convergence  conditions 

(H1N)  For  any  z  G  V,  there  exists  a  sequence  zN  G  such  that  | z  -  zN |y  — >  0  as 
N  — »  oo. 
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The  finite  dimensional  operators  and  approximating  system  are  then  determined  as 
follows.  The  operator  AN  :VN-^VN  which  approximates  A  is  defined  by  restricting 
a  to  x  V^;  this  yields 

(3.3)  (~AN T,  A)w  =  <t(T,  A)  for  all  T,AeV". 

For  each  N,  the  C0  semigroup  on  VA  which  is  generated  by  AN  is  denoted  by  TN(t). 
The  control  operator  is  approximated  by  BN  £  £(U,  VN)  given  by 

(3.4)  ( Bnu ,  A)  =  {u,  B*A)n  for  all  u  £  U ,  A  £ 

while  CN  denotes  the  restriction  of  the  observation  operator  C  to  VN .  Finally,  we  let 
PN  denote  the  usual  orthogonal  projection  of  7i  onto  VN  which  by  definition  satisfies 

(i)  PN  T  eV"  for  T  e  H 

(ii)  (PN  T  -  T,  A)  =  0  for  all  AeV". 

This  projection  can  be  extended  to  PN  £  £{V*,VN)  by  replacing  the  H- inner  product 
(T,  A)n  by  the  duality  product  (Y,  A)v,  v  and  considering  T  £  V*. 

The  approximate  problem  corresponding  to  (2.10)  with  g  =  0  can  then  be  formu¬ 
lated  as 

jt  (zN(t),  A)w  +  a(zN(t),  A)  =  (BNu(t),  A)h  for  all  A  £  VN 

2n(0)  =  PNz0 . 

This  has  the  solution 

zN(t)  =  TN{t)PNz0  +  TN(t  -  s)PNBNu(s)ds . 

The  following  theorems  taken  from  [2,  5]  can  be  used  to  establish  the  convergence 
of  the  approximate  gains  to  their  infinite  dimensional  counterparts  for  certain  classes 
of  shell  models  (see  specifically  Theorem  7.10  and  Lemma  7.13  of  [5]). 

Theorem  3.  Assume  that  the  injection  i  :  V  ^  H  is  compact.  Moreover,  suppose 
that  the  damping  sesquilinear  form  can  be  decomposed  as  <72  =  Sc i  +  02,  for  some 
5  >  0,  where  the  continuous  sesquilinear  form  <r2  satisfies  for  some  A  £  R 

Re  ct2 (<£,</>)  >  ~^l0lv  -  M&Ih  for  all  0  £  7  . 

Finally,  suppose  that  the  operator  where  A2  £  C(V,V*)  is  defined  by 

(A20,??)v<  y  =  0-2(0, 77),  is  compact  on  V. 

If  for  some  ui  £  R  and  M  >  1,  T{t)  satisfies 

\T(t)\w)  <  Me"1  ,  t>  0, 

then  for  any  e  >  0  there  exists  an  integer  Ne  such  that  for  N  >  Ne, 

\TN(t)PN\c{n)  <  Me{w+e)t  ,  t>  0 
for  some  constant  M  >  0  independent  of  N. 
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Theorem  4.  Assume  that  the  injection  i  :  V  H  is  compact.  Let  the  sesquilinear 
form  a  associated  with  the  first-order  system  (2.10)  be  continuous  and  V-elliptic. 
Assume  that  the  operators  A,  B,C  of  (2.11),  (2.9),  (3.1),  respectively,  satisfy:  (A,B) 
is  stabilizable  and  (A,C)  is  detectable  where  B  E  C(U,  V*)  is  unbounded  and  C  E 
C(H,Y)  is  bounded.  Consider  an  approximation  method  which  satisfies  (H1N).  Fi¬ 
nally,  suppose  that  for  fixed  N0  and  N  >  N0>  the  pair  (AN,BN)  is  uniformly  stabi¬ 
lizable  and  (AN,CN)  is  uniformly  detectable. 

Then  for  N  sufficiently  large,  there  exists  a  unique  nonnegative  self-adjoint  solution 
IF  E  £(V*,  V)  to  the  Nth  approximate  algebraic  Riccati  equation 

an'  iF  +  iFa^  -  nNBNn~1BN,uN  +  cN‘cN  =  o 

in  VN.  There  also  exist  constants  M3  >  1  and  w3  >  0  independent  of  N  such  that 

SN{t )  =  e(.AN-BNn~1BN‘nN)t  satisfies 

\SN(t)\vN<M3e-“3t  ,  t>  0. 

Moreover,  the  convergence  of  the  Riccati  and  control  operators 
UNPNz  A  Hz  in  V  for  every  z  E  V* 

\bn'hnpn  -B*n\  -»o, 

as  ./V— >oo,  is  obtained. 

Example  1.  We  consider  in  this  example  a  shell  with  constant  parameters  p,  E,  is,  C£>. 
Such  a  case  would  arise  if  modeling  a  homogeneous  shell  or  a  shell  in  which  the  vari¬ 
ance  of  material  properties  across  regions  with  actuators  is  negligible.  The  sesquilin¬ 
ear  forms  for  this  model  are  specified  in  (2.7).  Due  to  the  constant  coefficients,  <j2 
can  be  written  as  —  6a\  +  a2  where  8  =  ^  and  ^)  =  pfr0  It  follows 

immediately  that 

Rea2{<t>,(t>)  =fi  f  4>2dl  >  ~\<t>\2v 

JTa  2 

for  all  4>  EV .  The  boundedness  of  the  operator  A2  generated  by  b2  follows  directly 
from  the  boundedness  of  a2.  Furthermore,  it  is  noted  that  Ajj1  E  C(V*,  V)  can  be 
written  as  an  operator  on  V  — »  V  by  Ajj1  =  where  the  injections  i  :  V  ^ 

H,i*  :  H  V*  are  compact.  Thus  Ajj1  is  compact  on  V  which  implies  that  Af1  A2 
is  compact  on  V  since  it  is  formed  from  the  product  of  compact  and  bounded  linear 
operators.  Finally,  the  exponential  stability  of  T(t),  the  stabilizability  of  (A,B) 
and  the  detectability  of  (A,B)  are  guaranteed  by  Theorem  1.  The  hypotheses  of 
Theorem  3  are  then  satisfied  for  this  system  and  one  obtains  uniform  bounds  on  the 
approximating  semigroups.  The  convergence  of  the  Riccati  and  control  operators  is 
then  obtained  from  Theorem  4. 

3.2.  Periodic  Exogenous  Input.  A  reasonable  assumption  in  many  mechanical  sys¬ 
tems  is  that  g  is  periodic  in  time  with  period  r.  The  system  to  be  controlled  in  this 
case  is 


(3.5) 


z(t)  =  Az(t )  +  Bu(t)  +  g(t) 
z{  0)  =  *(t) 
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and  an  appropriate  quadratic  functional  to  be  minimized  is 

Mu)  =  ll0T{\Cz(t)\l  +  \n^u(t)\2u}dt. 

Note  that  the  periodic  exogenous  term  g  can  be  used  to  model  inputs  such  as  noise 
generated  by  rotating  engine  components  (e.g.,  propellers  or  turbines)  or  periodic 
electromagnetic  disturbances. 

To  guarantee  the  existence  of  a  unique  Riccati  solution  and  control  for  the  system 

(3.5),  it  is  assumed  that  (A,B)  is  stabilizable  and  (A,C)  is  detectable.  Furthermore, 
it  is  assumed  that  g  G  L2(0,r;7f)  and  that  B  is  bounded.  Under  these  conditions,  it 
is  verified  in  [6]  that  the  Riccati  equation 

A*  n  +  UA  +  BB'RTlB*Y[  +  C*C  =  0 

has  a  unique  solution.  Furthermore,  if  r  denotes  the  r-periodic  solution  of  the  adjoint 
or  tracking  equation 


r(t)  =  -[A  -  BTZ~1B*U]*r(t)  +  Ug(t) 
r(0)  =  r(r) 

and  z  is  the  closed  loop  solution  of 

z(t)  =  [A-  Bn-lB*H)z{t)  -  BTTxB*r(t)  +  g(t) 

*(0)  =  *(t)  , 

then  the  optimal  control  is  given  by 

(3.6)  u(t)  =  —/R~lB*\U.z{t)  —  r(t)] . 

The  LQR  theory  for  this  case  is  less  complete  than  that  for  systems  with  no 
exogenous  input  and  is  currently  limited  to  bounded  control  inputs  B.  The  synthesis 
of  the  theory  for  unbounded  input  operators  and  periodic  exogenous  forces  is  currently 
under  investigation.  The  effectiveness  of  the  method  is  illustrated  in  the  final  example 
of  this  work. 


4.  Approximation  Method 

A  Galerkin  method  was  used  to  approximate  the  solutions  u,  v,  w  to  the  system 

(2.6),  or  equivalently,  (2.10).  The  approximating  subspaces  were  taken  of  the  form 

=  span{RuJ  x  span{R„J  x  span{S,„t } 

where  BUk ,  BVk ,  BWk  denote  bases  for  the  u,v  and  w  displacements,  respectively.  To 
exploit  the  tensor  nature  of  the  shell  domain  T0  and  periodicity  in  6,  the  bases  were 
constructed  with  Fourier  components  in  0  and  cubic  splines  in  x  (see  [7]  for  details). 
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The  approximate  displacements  were  then  given  by  the  expansions 

Mu 

uN(t ,  0,x)  =  J2  Uk(t)BUk  ( 9 ,  x) 

k= 1 

vN(t,e,x)  =  Y^vk(t)BVk(6,x) 

k= 1 
Mw 

wN(t,  9,x)  =  Yl  wk(t)BWk  ( 9 ,  x) . 

k= 1 

To  obtain  a  finite  dimensional  system  with  matrices  corresponding  to  the  finite 
dimensional  operators  in  (3.3)  and  (3.4),  the  sesquilinear  forms  <j\  and  02  were  re¬ 
stricted  to  VN.  This  yields  the  matrix  system 


0  1  r  1  r  0 

0 

1 

c-+- 

11 

1 

-k?d 

0  1  r  **(o)  1  r  y? ' 

0 

Mu  \  [  )W(0)  J  [  y?  . 

iW(t) 


0 

Bm 


[«(*)]  + 


where  ^(t)  =  [iq  (<),...,  UMu,vx(t),  •  •  • ,  «fi (t), . . . ,  uwJT  contains  the  TV  =  TVU+ 
Ny+Nw  generalized  Fourier  coefficients.  The  s  patch  inputs  are  contained  in  u(t)  = 
[m(t), us(f)]T.  The  reader  is  referred  to  [7]  for  details  concerning  the  construction 
of  the  mass,  stiffness  and  damping  matrices  ,  K%,  K/fn,  the  inputs  ,  g^ (t)  and 
the  initial  conditions  ,  ytf  ■ 

Multiplication  by  the  inverted  mass  matrix  yields  the  Cauchy  equation 


(4.1) 


zN(t)  =  ANzN(t)  +  BNu{t)+gN(t) 
zN(0)  =  ztf  , 


where  zN  e  M2A^  =  [&Ar(t),'dJ^(t)]T.  This  system  forms  the  constraint  equations  used 
in  the  finite  dimensional  LQR  theory  discussed  in  Section  3. 


5.  Numerical  Example 

We  consider  here  an  exogenous  force  g  which  is  periodic  in  time  with  period  r  = 
10007T  (500  Hz).  The  distribution  of  the  force  was  taken  to  be  binormal  in  the 
transverse  and  longitudinal  directions  and  centered  at  (x,9)  —  (i/2,0)  and  (x,0)  = 
(e/2, 7 r)  as  depicted  in  Figure  2.  The  magnitude  of  the  transverse  component  qn  was 
one  hundred  times  that  of  the  longitudinal  component  qx  so  as  to  model  an  input 
consisting  primarily  of  acoustic  sources  located  adjacent  to  (i/2,0)  and  (1/ 2, it). 

Six  pairs  of  piezoceramic  patches  of  length  1  cm  and  radial  measure  7t/3  were 
employed  as  actuators.  The  locations  and  material  properties  of  the  patches  along 
with  the  dimensions  and  physical  parameters  for  the  shell  are  summarized  in  Table  1. 

To  accommodate  the  periodic  exogenous  force  g ,  control  inputs  to  the  twelve 
patches  were  computed  using  the  feedback  law  (3.6).  Note  that  in  this  formula¬ 
tion,  independent  voltages  are  determined  for  the  individual  patches.  This  provides 
the  capability  of  generating  both  inplane  forces  and  bending  moments  in  the  regions 


R.C.H.  del  Rosario  and  R.C.  Smith 


261 


covered  by  the  patches  so  that  longitudinal,  circumferential  and  transverse  vibrations 
can  be  controlled. 


Figure  2.  Distribution  of  normal  forcing  function  at  6  —  0  and  6  =  n.  Observation 
lines  L\  =  { (m, 0) |0  <  x  <  £,8  =  7r/6},  i2  =  {{x,8)\x  =  3^/4, 0  <  9  <  2i r}  and 
observation  point  pi  =  (3^/4,  7t/32). 


Time  histories  of  the  uncontrolled  and  controlled  shell  displacements  at  the  point  p\  = 
(3^/4,  7t/32),  depicted  in  Figure  2,  are  plotted  in  Figure  3.  The  open  loop  trajectories 
exhibit  both  a  transient  response  settling  into  steady  state  and  a  beat  phenomenon 
due  to  the  close  proximity  of  the  driving  frequency  and  natural  frequencies  for  the 
shell.  At  this  observation  point,  all  three  displacement  components  are  reduced  by 
more  than  90%  when  controlling  voltages  are  fed  back  to  the  patches. 


Dimensions 

Parameters 

h  =  .00127  m 

p  =  2700  kg/m 3 

R=  Am 

£  =  7.1  x  10  wN/m2 

Shell 

1=1  m 

cd  =  2.816  x  10 ~5  Nms 
v=  .33 

f. i  =  58.97Ns/m2 

hpe  =  ,0001778  m 

pve  =  7600  kg/m3 

Epe  =  6.3  x  1010  N/m2 

Patches 

Centers  (x,6):  (.25,0),  (.5,0),  (.75,0) 

cd  =  3.211  x  10  ~5  Nms 

J-'pe 

(•25, 7r),  (.5, 7r),  (.75, 7r) 

CO 

II 

£ 

Dimensions:  x  :  0.1  cm,  6  :  7t/3 

dz i  =  190  x  10  12  m/V 

Table  1.  Dimensions  and  physical  parameters  for  the  shell  and  patches. 
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.o  I - 1 - 1 - ' - ' - J - 1 - 1 - 1 - 1 

0  0.01  0.02  0.03  0.04  0.05  0.06  0.07  0.08  0.09  0.1 

Time  (seconds) 


Figure  3.  Uncontrolled  and  controlled  shell  displacements  at  the  point 

Pi  =  (3^/4,  7t/32)  ; 

(a)  longitudinal  u, 

(b)  circumferential  v, 

(c)  transverse  w  displacements; 

— —  (uncontrolled), - (controlled). 


To  illustrate  the  spatial  attenuation  due  to  the  feedback  of  voltages  to  the  patches, 
root  mean  square  (rms)  plots  of  the  uncontrolled  and  controlled  trajectories  along 
the  axial  line  Li  and  circumferential  line  L2  (see  Figure  2)  are  plotted  in  Figure  4 
and  5,  respectively.  For  the  open  loop  case,  these  plots  illustrate  a  standing  wave  in 
all  three  components  of  the  displacement.  The  figures  also  demonstrate  significant 
reductions  in  all  three  displacement  levels,  even  in  regions  not  covered  by  patches. 
This  further  illustrates  the  effectiveness  through  which  the  model-based  control  law 
can  be  used  to  attenuate  shell  vibrations. 


Figure  4.  Root  mean  square  (rms)  displacements  along  the  axial  line  L\ ; 
- (uncontrolled), - (controlled). 


Figure  5.  Root  mean  square  (rms)  displacements  along  the  circumferential  line  L 
- (uncontrolled), - (controlled). 
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6.  Conclusions 

A  model-based  LQR  method  for  controlling  shell  vibrations  has  been  presented 
here.  While  developed  in  the  context  of  a  modified  Donnell-Mushtari  cylindrical  shell 
model,  the  method  is  quite  general  and  can  be  directly  extended  to  other  models  and 
geometries.  Under  the  assumption  of  strong  or  Kelvin- Voigt  damping  (a  reasonable 
and  typical  assumption  for  many  shell  materials),  model  well-posedness  and  conver¬ 
gence  of  control  gains  is  obtained  using  analytic  semigroup  theory  combined  with 
LQR  results  for  unbounded  input  operators. 

The  Galerkin  method  used  to  approximate  the  system  dynamics  utilizes  bases  con¬ 
structed  from  tensored  Fourier  polynomials  and  modified  cubic  splines.  As  discussed 
in  [7],  case  must  be  taken  when  developing  methods  for  approximating  shell  dynam¬ 
ics  so  as  to  avoid  shear  or  membrane  locking.  One  manifestation  of  locking  is  the 
existence  of  model  dynamics  which  are  incorrectly  approximated  by  the  numerical 
method.  The  use  of  a  numerical  method  which  exhibits  locking  can  lead  to  a  loss  of 
control  authority  and  potential  controller  destabilization  if  the  approximations  are 
sufficiently  inaccurate.  Further  details  regarding  issues  concerning  the  approximation 
of  shell  dynamics  and  convergence  properties  of  the  numerical  method  can  be  found 
in  [7], 

The  numerical  example  demonstrates  the  effectiveness  of  the  model-based  control 
method  for  attenuating  all  three  components  of  the  shell  displacement  in  the  pres¬ 
ence  of  both  transient  and  steady  state  dynamics.  Furthermore,  by  modeling  the 
global  shell  dynamics  and  patch  interactions  through  coupled  PDE  and  construct¬ 
ing  the  control  law  in  terms  of  these  PDE,  significant  reductions  in  displacement 
levels  throughout  the  shell  are  obtained,  even  in  regions  devoid  of  patches.  Numer¬ 
ical  implementation  of  the  LQR  method  in  this  manner  provides  a  first  step  toward 
the  development  of  model-based  state  estimators  and  compensators  which  can  be 
experimentally  implemented  in  shell  applications. 

Acknowledgements:  The  authors  thank  H.T.  Banks,  CRSC,  North  Carolina  State 
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The  Algebraic  Riccati  Equation  in 
Discrete  and  Continuous  Time 

OLOF  J.  STAFFANS 
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Abstract.  The  standard  state  space  solution  of  the  finite-dimensional  continuous  time 
quadratic  cost  minimization  problem  has  a  straightforward  extension  to  infinite-dimensional 
problems  with  bounded  or  moderately  unbounded  control  and  observation  operators.  How¬ 
ever,  if  these  operators  are  allowed  to  be  sufficiently  unbounded,  then  a  strange  change 
takes  place  in  one  of  the  coefficients  of  the  algebraic  Riccati  equation,  and  the  continuous 
time  Riccati  equation  begins  to  resemble  the  discrete  time  Riccati  equation.  To  explain 
why  this  phenomenon  must  occur  we  discuss  a  delay  equation  of  difference  type  that  can  be 
formulated  both  as  a  discrete  time  system  and  as  a  continuous  time  system,  and  show  that 
in  this  example  the  continuous  time  Riccati  equation  can  be  recovered  from  the  discrete 
time  Riccati  equation.  A  particular  feature  of  this  example  is  that  the  Riccati  operator 
does  not  map  the  domain  of  the  generator  into  the  domain  of  the  adjoint  generator,  as  it 
does  in  the  standard  case. 

1991  Mathematics  Subject  Classification.  Primary  49J27,  93A05,  93C55,  93C60 

Key  words  and  phrases.  Algebraic  Riccati  equation,  linear  quadratic  optimal  control,  well- 
posed  linear  systems,  spectral  factorization. 


1.  The  Discrete  and  Continuous  Time  Riccati  Equations 

We  begin  by  comparing  two  different  optimization  problems,  namely  the  discrete 
time  and  the  continuous  time  quadratic  cost  minimization  problems. 

In  the  discrete  time  quadratic  cost  minimization  problem  we  study  the  discrete 
time  system 

%n+l  —  Axn  -j-  Bun, 

'  yn  =  Cxn  +  Dun,  n  e  N  =  {0,1,2, 

Here  un  belongs  to  the  input  space  U,  xn  to  the  state  space  H,  and  yn  to  the  output 
space  Y.  These  are  Hilbert  spaces,  and  A,  B,  C  and  D  are  bounded  linear  operators 
between  the  appropriate  spaces.  The  problem  is  to  find  a  sequence  un  G  f2(N;t/) 
that  minimizes  the  cost  functional 

OO 

(1.2)  W(X0,U)  =  (( Vn,yn )  +  {un,Run))  , 

n— 0 

where  R  is  a  given  positive  definite  operator  on  U.  Under  mild  assumptions  (sta- 
bilizability,  detectability,  and  coercivity  of  the  cost  function),  the  optimal  control 
u°pt  js  0f  sfate  feedback  type,  i.e.,  there  is  a  bounded  linear  operator  K  such  that 
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u: =  Kxopt  for  all  n  G  N,  and  the  optimal  cost  W{x 0,uopi)  can  be  written  in  the 
form 

W( x0,  wopt)  =  {x0,  Px o),  x0  G  H, 

where  P  is  a  positive  definite  operator  on  H,  the  Riccati  operator.  Moreover,  the 
feedback  operator  K  and  the  Riccati  operator  P  satisfy  the  equations 

(1.3)  SK  =  —  (B*PA  +  D*C) , 

(1.4)  A*  PA  —  P  +  C*C  =  K*SK, 

(1.5)  S  =  R  +  D*D  +  B*PB. 

We  shall  refer  to  these  equations  as  the  discrete  time  Lure  equations.  We  call  S  the 
sensitivity  operator  of  the  discrete  time  problem,  due  to  the  fact  that  it  describes  the 
sensitivity  of  the  optimal  solution  with  respect  to  a  nonzero  closed  loop  control  signal; 
cf.  [Malinen(1997)].  In  the  standard  case  the  sensitivity  operator  S  is  invertible,  and 
by  eliminating  K  and  S  we  get  the  discrete  time  algebraic  Riccati  equation 

(1.6)  A*PA-P+C*C=(B*PA+D*C)*{R+D*D+B*PBy1(B*PA+D*C) . 

See,  for  example,  [Curtain  and  Zwart(1995),  pages  329-332]  or  [Malinen(1997)]. 

In  the  continuous  time  quadratic  cost  minimization  problem  we  study  the  contin¬ 
uous  time  system 

z'(t)  =  Az(t)  +  Bu(t), 

(1.7)  y(t)  =  Cz(t)  +  Du(t),  teR+  =  (0,  oo), 

z(0)  =  Zq. 

Here  u(t)  belongs  to  the  input  space  U,  z(t)  to  the  state  space  H,  and  y(t)  to 
the  output  space  Y,  still  Hilbert  spaces.  We  suppose  that  A  generates  a  strongly 
continuous  semigroup  A  on  H  and  that  D  is  bounded,  and,  for  the  moment,  we  also 
take  the  operators  B  and  C  to  be  bounded.  Naturally,  we  interpret  (1.7)  in  the  strong 
sense,  i.e.,  £  is  given  by  z(t)  =  A(t)x0  +  Jq  A(t  —  s)Bu(s)  ds  for  t  G  R+.  This  time 
the  problem  is  to  find  a  control  u€  L2(R+;U)  that  minimizes  the  cost  functional 

POO 

(1.8)  W{x0,u)  =  Jo  {{y{t),y(t))  +  (u{t),Ru(t)))  dt, 

with  a  positive  definite  R.  Again,  under  mild  assumptions  (stabilizability,  detectabil¬ 
ity,  and  coercivity  of  the  cost  function),  the  optimal  control  u°pt  is  of  state  feedback 
type,  i.e.,  there  is  a  bounded  linear  operator  1C  such  that  uopt(t )  =  JCzopt(t)  for  all 
t  G  R+,  and  the  optimal  cost  W{x 0,  varpt)  can  be  written  in  the  form 

W(xo,u°pt)  =  (xq,Vxo),  x0eH, 

where  V  is  the  positive  definite  continuous  time  Riccati  operator.  Moreover,  the 
feedback  operator  1C  and  the  Riccati  operator  V  satisfy  the  continuous  time  Lure 
equations 

(1.9)  SK  =  -  [B*V  +  D*C) , 

(1.10)  A*V  +  VA  +  C*C  =  JC*S1C, 

(1.11)  S  =  R  +  D*D. 
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Equations  (1.9)  and  (1.11)  hold  on  H  and  U,  respectively,  and  (1.10)  is  valid  on 
the  domain  dom(A)  of  A;  in  particular,  V  maps  dom(A)  into  dom(A*).  Again  we 
call  S  the  sensitivity  operator  of  the  continuous  time  problem,  since  it  describes  the 
sensitivity  of  the  cost  function  with  respect  to  a  nonzero  closed  loop  input.  In  the 
standard  case  the  sensitivity  operator  S  is  invertible,  and  we  can  eliminate  K  and  <S 
to  get  the  continuous  time  algebraic  Riccati  equation 

(1.12)  A*V  +  VA  +  C*C  =  {B*V  +  D*C)*  (R  +  D*D)~l  ( B*V  +  D*C) , 

valid  on  dom(A).  See,  for  example,  [Curtain  and  Zwart(1995),  pages  316-317]  or 
[Staffans(1997b)]. 

There  are  some  striking  similarities  and  differences  between  the  discrete  and  con¬ 
tinuous  time  Lure  equations  (1.3)— (1.5)  and  (1.9)— (1.11)  and  Riccati  equations  (1.6) 
and  (1.12).  Maybe  the  most  important  difference  is  that  the  discrete  time  sensitivity 
operator  S  depends  on  the  discrete  time  Riccati  operator  P,  but  that  the  continuous 
time  sensitivity  operator  <S  does  not  depend  on  the  continuous  time  Riccati  opera¬ 
tor  V,  and,  if  we  ignore  the  difficulties  caused  by  the  unbounded  operator  A,  the 
structure  of  the  discrete  time  Lure  equations  (1.3)— (1.5)  is  more  complicated  than 
the  structure  of  the  continuous  time  equations  (1.9)— (1.11). 

2.  The  Discrete  and  Continuous  Time 
Closed  Loop  Riccati  Equations 

Above  we  have  written  the  discrete  and  continuous  time  optimality  conditions  and 
Riccati  equations  in  “open  loop”  form,  i.e.,  they  are  written  in  terms  of  the  original 
operators  A,  B ,  C,  and  D  in  (1.1),  and  A,  B,  C,  and  D  in  (1.7).  It  is  also  possible 
to  give  “closed  loop”  versions  of  the  same  equations. 

If  we  in  (1.1)  replace  un  by  a  new  control  v„  according  to  the  formula 

un  =  Kxn  +  vn,  n6  N, 

then  (1.1)  becomes 

xn-i-i  —  A(*jXn  T  Bvn , 

Vn  =  Coxn  +  Dvn,  n  G  N, 

where 


Aa  =  A  +  BK,  C0  =  C  +  DK. 

Replacing  A  and  C  in  (1.3)— (1.5)  by  A  —  A$  —  BK  and  C  =  Cq  —  DK  we  get  the 
discrete  time  closed  loop  Lure  equations 

RK  =  -  {B*PA0  +  D*Cq)  , 

A*0PAo  -P  +  C*QC0  =  -K*RK, 

The  operator  R  need  not  be  invertible,  but  if  it  is,  then  we  can  eliminate  K  to  get 
the  discrete  time  closed  loop  algebraic  Riccati  equation 

A*0PA0  -P  +  C*0Co  =  -  ( B*PA0  +  D*Co)*  R~l  ( B*PA0  +  D*C0) . 
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The  difference  compared  to  (1.3)— (1.6)  consists  in  a  change  of  sign  in  the  quadratic 
term,  and  the  fact  that  the  operator  S  has  been  replaced  by  R  and  no  longer  enters  the 
equations.  Clearly,  the  invertibility  of  R  is  a  stronger  condition  than  the  invertibility 
of  S. 

In  the  continuous  time  case  we  can  proceed  in  the  same  way.  We  separate  the 
feedback  contribution  to  the  control  from  the  external  control,  and  write 

u(t)  =  Kx(t)  +  v(t),  teR, 

to  get  the  closed  loop  system 

z'(t)  =  Aoz(t)  +  Bv(t), 

y(t)  =  Cc)z(t )  +  Dv(t),  t  £  K+, 

2(0)  =  Zq, 

where 

A0  =  A  +  S/C,  CQ  =  C  +  DIC. 

Replacing  A  and  C  in  (1.9)— (1.11)  by  A  =  Ao  —  BK.  and  C  =  Co  —  DIC  we  get  the 
continuous  time  closed  loop  Lure  equations 

RIC  =  —  ( B*V  +  D*C0) , 

(21)  A*0V  +  VAo  +  CoC(j  =  —1C*  RIC.  , 

If  R  is  invertible,  then  we  can  eliminating  K.  to  get  the  continuous  time  closed  loop 
algebraic  Riccati  equation 

(2.2)  A*0V  +  VAo  +  C*0 Co  =  -  (B*V  +  D*Co)*  R~l  ( B*V  +  D*C0)  ■ 

Again,  the  invertibility  condition  on  R  is  a  stronger  one  than  the  invertibility  condi¬ 
tion  on  S  (whenever  D  is  nonzero).  Comparing  these  equations  to  the  corresponding 
open  loop  equations  we  see  the  same  changes  as  in  the  discrete  time  case. 

The  closed  loop  discrete  and  continuous  time  Lure  equations  and  Riccati  equations 
resemble  each  other  more  than  the  corresponding  open  loop  equations,  due  to  the  fact 
that  the  operators  S  and  S  have  disappeared.  However,  observe  that  the  closed  loop 
equations  contain  an  extra  implicit  dependence  on  the  feedback  operators  K  and  K, 
hidden  in  the  definitions  of  Aq,  Co,  Ao,  and  Co,  and  that  they  are  less  general  in  the 
sense  that  we  need  an  invertibility  condition  on  R  instead  of  invertibility  conditions 
on  S  and  S,  respectively. 

3.  Unbounded  Control  and  Observation  Operators 

Up  to  now  we  have  assumed  the  continuous  time  control  operator  B  and  ob¬ 
servation  operator  C  to  be  bounded.  They  can  be  allowed  to  be  somewhat  un¬ 
bounded  without  any  significant  nontechnical  additions  to  the  theory.  This  applies, 
in  particular,  to  the  class  of  smooth  Pritchard-Salamon  systems  studied  in,  e.g., 
[Pritchard  and  Salamon(  1985),  Pritchard  and  Salamon(1987)]  and  [van  Keulen(1993)]. 
However,  if  B  and  C  are  sufficiently  unbounded  then  the  structure  of  the  continuous 
time  Lure  equations  (1.9)— (1.11)  changes,  and  they  become  even  more  similar  to  the 
discrete  time  Lure  equations  (1.3)— (1.5). 
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The  main  problem  is  how  to  define  the  term  B*V  in  (1.9)  when  B  is  unbounded. 
The  largest  class  of  systems  that  we  are  able  to  cope  with  is  the  class  of  well- 
posed  and  regular  Salamon- Weiss  systems;  see  [Salamon(1987),  Salamon(1989)]  and 
[Weiss(  1994a),  Weiss(1994b)]  for  the  relevant  theory.  In  this  theory  the  natural  do¬ 
main  for  B*  is  dom(A*).  As  before,  we  want  (1.10)  to  hold  on  dom(A),  hence  (1.9) 
should  also  hold  at  least  on  dom(A).  Thus,  the  operator  B*V  should  be  defined  at 
least  on  dom(A).  Since  the  natural  domain  of  B*  is  dom(A*),  we  would  like  V  to 
map  dom(A)  into  dom(A*)  (as  it  does  in  the  case  of  bounded  B  and  C).  However, 
this  will  not  be  true  in  general,  and  in  particular,  it  is  not  true  in  the  example  that  we 
present  below.  Thus,  we  are  forced  to  extend  B*  to  a  larger  domain.  This  extension 
is  not  unique,  due  to  the  fact  that  dom(A*)  need  not  be  dense  in  the  larger  domain 
(this  will  be  the  case  in  the  example  given  below). 

The  necessary  extension  of  B*  can  be  carried  out  in  at  least  two  different  ways. 
Instead  of  extending  B*,  [Flandoli  et  al.(1988)]  show  that  in  the  case  where  D  =  0 
and  C  is  bounded  it  is  possible  to  find  some  extension  of  B*V  such  that  the  Riccati 
equation  (1.12)  holds  on  dom(A)  (this  result  applies  to  some  non-regular  systems  as 
well).  However,  the  definition  of  the  extended  B*T  given  by  [Flandoli  et  al. (1988)]  is 
quite  implicit  (it  is  part  of  the  proof  of  [Flandoli  et  al.(1988),  Corollary  4.9]),  and  it 
far  from  obvious  how  to  compute  this  extension  from  the  original  data.  Moreover,  it 
is  not  clear  to  what  extent  that  result  applies  when  D  ^  0  or  C  is  unbounded  (as  is 
the  case  in  the  example  that  we  present  below). 

Our  solution,  found  in  [Staffans(1997a),  Staffans(1997b)],  is  quite  different.  We 
impose  an  extra  “regular  spectral  factorization  assumption” ,  the  content  of  which 
is  that  both  the  input/output  map  of  the  original  system  and  a  particular  spectral 
factor  should  be  regular  together  with  their  adjoints  in  the  sense  of  [Weiss(1994a)]. 
See  [Staffans (1997a),  Staffans (1997b)]  for  details.  In  order  to  verify  this  assumption 
for  a  particular  system  one  needs  good  information  about  its  input/output  behavior. 
This  type  of  information  is  readily  available  for  delay  equations  but  not  for  general 
PDEs.  In  particular,  it  follows  from  [Staffans(1995),  Lemma  2.1]  that  this  assumption 
is  satisfied  in  the  example  presented  below,  but  it  is  still  an  open  question  whether 
or  not  it  is  satisfied  in  most  of  the  really  interesting  PDE  examples. 

The  regular  spectral  factorization  assumption  enables  us  to  replace  the  extension  of 
B*V  used  in  [Flandoli  et  al.(1988)]  by  B* V,  where  B*  stands  for  the  straightforward 
Weiss  extension  [Weiss  (1994a)] 

(3.1)  B*x=  lim  B*(3((3I  -  A*)~l  x 

of  B*.  As  shown  in  [Staffans(1997b)],  if  we  use  this  extension,  then  we  must  add  a  cor¬ 
rection  term  to  the  continuous  time  sensitivity  operator  S  and  replace  the  definition 
(1.11)  of  <S  by 

(3.2)  S  =  R  +  D*D  +  Jm<xB*V{aI-A)~1B 

(this  limit  exists  in  the  strong  sense  whenever  the  regular  spectral  factorization  as¬ 
sumption  holds).  Equations  (1.9)  and  (1.10)  remain  valid  (with  B*  replaced  by  B*). 
Observe  that  (3.2)  agrees  with  (1.11)  whenever  B  is  bounded.  As  in  the  discrete  time 
case,  it  can  be  shown  [Staffans(1997a)]  that  S  >  R+  D*D,  and  that  S  depends  only 
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on  the  weight  R  and  the  transfer  function  of  the  system,  i.e.,  S  is  independent  of 
the  particular  realization  A,  B,  and  C.  The  physical  interpretations  of  S  and  S  are 
identical  [Malinen(1997),  Staffans(1997b)]. 

Since  the  sensitivity  operator  S  does  not  show  up  in  the  closed  loop  Lure  and 
Riccati  equations,  it  is  to  be  expected  that  these  should  still  remain  the  same  as  in 
the  case  of  bounded  control  operator  B  and  observation  operator  C.  Indeed,  this  is 
the  case,  as  shown  in  [Staffans(1997a)]. 

The  purpose  of  this  paper  is  to  present  an  example  where  the  change  from  (1.11)  to 

(3.2)  takes  place.  This  example  is  a  delay  equation  of  difference  type.  It  can  be  formu¬ 
lated  both  as  a  discrete  time  system  and  as  a  continuous  time  system.  In  this  example 
the  continuous  time  sensitivity  operator  S  is  the  same  as  the  discrete  time  sensitiv¬ 
ity  operator  S,  and  the  two  Riccati  equations  (1.6)  and  (1.12)  (with  ( R  +  D*D) 
replaced  by  <S_1)  become  more  or  less  equivalent.  We  remark  that  this  example  has 
been  discovered  independently  by  [Weiss  and  Weiss(1997)].  Some  additional  details 
of  this  example  are  presented  in  [Staffans(1996a)j  and  [Weiss  and  Weiss(1997)]. 

Another  example  illuminating  the  difference  between  the  two  extensions  of  B* 
used  in  [Staffans( 1997b)]  and  [Flandoli  et  al.(1988)]  is  found  in  a  recent  preprint  by 
[Weiss  and  Zwart(1996)].  In  that  example  D  =  0  and  C  is  bounded. 

For  completeness,  let  us  point  out  the  fact  that  the  present  theory  says  nothing 
about  the  solvability  of  the  system  (1.9),  (1-10),  and  (3.2):  Is  the  solution  unique, 
and  can  these  equations  be  used  to  actually  compute  If,  K  and  <S?  In  other  words, 
the  converse  part  of  the  theory  is  still  missing. 

4.  The  Delay  Equation 

In  the  rest  of  this  note  we  consider  the  following  delay  equation  of  difference  type: 
x(t)  =  Ax(t  —  T)  +  Bu(t) ,  t  e  [0,oo), 

(4.1)  y(t)  =  Cx(t-T)+Du(t),  fe[0,oo), 

x(t)  =  given,  t=[-T,  0). 

Here  u(t)  G  U,  x(t)  G  H ,  y(t)  G  Y  (all  Hilbert  spaces),  and  A,  B,  C,  and  D  are 
bounded  linear  operators  between  the  appropriate  spaces.  For  simplicity,  we  assume, 
in  addition,  that  the  system  (4.1)  is  exponentially  stable,  but  this  assumption  can 
be  replaced  by  a  stabilizability  and  detectability  assumption.  Moreover,  we  assume 
that  the  data  have  been  chosen  in  such  a  way  that  the  discrete  time  Lure  equations 

(1.3) — (1.5)  have  a  unique  solution,  with  P  positive  definite  and  S  strictly  positive 
definite,  i.e.,  S  >  el  for  some  e  >  0.  The  cost  function  W  that  we  want  to  minimize 
is  given  by  (1.8). 
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5.  Two  Discrete  Time  Formulations 

It  is  easy  to  reformulate  (4.1)  as  a  discrete  time  system.  This  can  even  be  done  in 
two  conceptually  different  ways.  In  both  cases  we  start  with  the  observation  that  if 
we  define 

un(t)  =  u(t  +  nT ), 

xn(t)  =  x(t  +  (n  -  1)T), 

yn(t)  =  y{t  +  nT),  neN,  te[0,T), 

then  (4.1)  becomes 

^n+ l(f)  —  Axn(P)  T  R'U,n(t')i 

(5.1)  yn(t)  =  Cxn(t)  +  Dun(t),  neN,  t  e  [0,T), 

Xo(t)  =  given,  fe[0,T), 

and  the  cost  function  W  can  be  written  in  the  form 

fOO 

W(x0,  u)  =  /  (( yit ),  y(t))  +  (u{t),  Ru(t)))  dt 

JO 

°°  (  rT 

(5.2)  =  Y,  /  i(yn(t),yn(t))  +  {un(t),Run(t)))  dt 

n=0  \J0 

=  [  (f^({yn(t),yn{t))  +  {un(t),Run{t)))  \  dt. 

J0  \n= 0  / 

The  two  different  expressions  given  above  for  the  cost  function  gives  rise  to  two 
different  interpretations.  In  the  first  interpretation  we  take  the  input,  state,  and 
output  spaces  to  be 


U  =  L2(0,T;  U),  H  =  L2{0,T-H),  T  =  L2(0,T;F), 

and  we  have  a  standard  discrete  time  minimization  problem. 

In  the  second  interpretation  we  observe  that,  for  each  fixed  t  G  [0,  T),  the  sequences 
xn(t)  and  yn(t )  depend  only  on  x0{t)  and  un(t),  and  not  on  a:0(s)  and  un(s)  for  s^t. 
This  means  that  the  system  (5.1)  is  really  a  collection  of  independent  equations, 
parametrized  by  the  real  parameter  t  e  [0,T).  Moreover,  it  follows  from  the  last 
line  in  (5.2)  that  in  order  to  minimize  the  total  cost  it  suffices  to  minimize  each 
t-parametrized  problem  separately.  Thus,  in  this  interpretation,  we  have  an  infinite 
number  (parametrized  by  t  e  [0,  T))  of  problems  that  are  otherwise  identical,  but  have 
different  initial  states  x0  =  x0(t).  Each  subproblem  is  a  discrete  time  minimization 
problem  of  the  type  described  in  Section  1,  with  input  space  U,  state  space  H,  and 
output  space  Y.  Let  us  in  the  sequel  denote  the  common  optimal  feedback  and 
Riccati  operators  for  these  subproblems  by  K  and  P,  respectively.  Then,  for  all 
t  £  [0,T)  and  n  e  N,  we  have  <pt(t)  =  Kx^^t),  or  if  we  recall  the  definitions  of 
xn(t)  and  un(f), 


(5.3) 


uopt{t)  =  Kxov\t-T),  t>  0. 
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Moreover,  by  (5.2),  the  total  optimal  cost  will  be 

W(x0,uopt)=  [TW(x0{t),u°pt(t))dt 
Jo 

(5.4)  =  j  (x0(t),Px0(t))  dt 

r° 

=  /  (x(t),Px(t))  dt. 

J-T 

6.  A  Continuous  Time  Formulation 

Although  the  two  discrete  time  formulations  given  above  are  very  natural,  the  most 
common  approach  is  to  formulate  (4.1)  as  a  continuous  time  problem  rather  than  a 
discrete  time  problem.  Equation  (4.1)  is  a  special  case  of  what  is  usually  called  a 
“difference  equation”;  see  [Hale(1977),  Section  12.3].  The  standard  method  to  rewrite 
this  into  a  continuous  time  system  is  to  solve  (4.1)  to  get  x,  then  to  translate  x  to 
the  left,  and  to  restrict  x  to  [— T,  0)  to  get  a  new  initial  function  given  on  [— T,  0) 
for  the  same  equation.  In  this  setting  the  input,  state,  and  output  spaces  become  U, 
H  =  L2(—T ,  0;  H),  and  Y,  respectively,  and  the  state  z(t)  at  time  t  is  given  by 

z(t)  =  ($  x(t  +  s)) ,  s  €  [— T,  0). 

The  generator  A  of  the  semigroup  that  we  get  in  this  way  is  the  differentiation 
operator 

(6.1)  Az  =  z',  dom(A)  =  {z6  0;  H)  \  z{0)  =  Az(-T )  }. 

Its  adjoint  is  the  differentiation  operator 

(6.2)  A*z=-z',  dom{A*)  =  {z€W1’2(-T,0;H)\z(-T)  =  A*z{0)}. 

The  input  and  output  operators  B  and  C  are  unbounded,  and  they  are  defined  through 
the  equations 

(6.3)  B*z  =  B*z{ 0)  for  z  €  dom(A*),  Cz  =  Cz(—T )  for  z  €  dom(A). 

The  resulting  system  is  well-posed  and  regular.  For  details,  see  [Staffans( 1996b), 
Theorem  6.1]. 

From  the  discussion  in  the  previous  section  we  know  the  optimal  solution  to  the 
quadratic  cost  minimization  problem.  By  (5.3),  uopt(t )  =  Kxopt(t  —  T ),  hence  the 
continuous  time  state  feedback  operator  K.  is  unbounded,  and  it  is  given  by 

(6.4)  K-z  —  Kz(—T),  z  e  dom(A), 
and  by  (5.4),  the  continuous  time  Riccati  operator  is 

(6.5)  Vz  =  Pz,  z  £  L2{-T ,  0;  H). 

We  claim  that  V  does  not  in  general  map  dom(A)  into  dom(A*).  This  can  be 
seen  as  follows.  By  (6.1)  and  (6.2),  V  maps  dom(A)  into  dom(A*)  if  and  only  if 
P  =  A* PA.  However,  if  this  is  the  case,  then  we  can  iterate  the  equation  P  =  A* PA 
to  get  P  =  ( A*)kPAk  for  every  k  €  N,  and  letting  k  — >  oo  we  find  that  P  =  0.  Thus, 
the  only  case  in  which  V  maps  dom(A)  into  dom(A*)  is  when  P  =  0,  i.e.,  the  optimal 
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cost  is  zero  and  also  V  =  0.  In  all  other  cases,  in  order  to  give  a  meaning  to  the 
term  A*V  in  (1.12)  we  have  to  extend  the  domain  of  B* .  Since  V  maps  dom(A )  into 
W1,2(—T,  0;  H),  it  suffices  to  define  B*x  for  all  x  e  W1,2(—T,  0;  H )  (see  the  discussion 
in  Section  2).  Equation  (6.3)  does  not  define  B*  uniquely  on  W1’2(-T,0;  H)  since 
dom(A*)  is  not  dense  in  this  space.  The  Weiss  extension  of  B*  (cf.  (3.1))  is  given  by 

(6.6)  B*x  =  B*x{  0), 

for  all  x  in  W1,2(—T,  0;  H). 

7.  Computation  of  the  Operator  S 

Since  we  know  V,  we  can  compute  <S  from  (3.2).  For  each  /  £  H  =  L2(—T,  0;  H ) 
and  a  €  p(A)  we  have 

(( al  -  A)~l  f)  (t)  =  (/  -  e~aTA)  1 J  e“(t_s)/(s) ds  ~  J  T  e“(^s)/(s) ds- 

By  letting  /  tend  to  Bu  =  B8qu  (where  80  is  the  unit  atom  at  zero)  in  the  distribution 
sense  we  get 

(7.1)  ((a/  -  ^)_1  Bu)  (t)  =  eat  (/  -  e~aT A)'1  Bu, 

hence,  by  (3.2)  and  (6.6), 

Su  =  (R  +  D*D)u  +  Jim  WV  (a/  -  A)~l  Bu 

=  (R  +  D*D)u  +  Jim  B*P  (/  -  e~aT A)  _1  Bu 
=  (R  +  D*D)u  +  B*PBu  =  Su. 

Thus  S  =  S. 

8.  Verification  of  the  Modified  Continuous  Time  Lure  Equations 

Above  we  have  solved  the  quadratic  cost  minimization  problem  for  equation  (1.1) 
with  cost  function  (1.2)  by  appealing  to  the  discrete  time  theory.  Here  we  shall  show 
that  the  continuous  time  feedback  operator  /C  and  Riccati  operator  V  satisfy  (1.9) 
and  (1.10),  where  S  is  the  operator  that  we  computed  above,  i.e.,  S  —  S. 

Let  us  start  with  the  verification  of  (1.9).  Take  z  €  dom(A).  By  (6.4),  ICz  = 
Kz(—T),  and  by  (6.5)  and  (6.6), 

(B*V  +  D*C)z  =  B*Pz{  0)  +  D*Cz(—T). 

Replacing  z(0)  by  Az(— T)  we  get 

(B*P  +  D*C)z  =  ( B*PA  +  D*C)z(—T). 

Thus,  (1.9)  follows  from  (1.3),  (6.4),  and  the  fact  that  S  =  S. 
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It  remains  to  verify  (1.10).  Take  Zo,  Z\  £  dom(A).  Since  V  maps  TT1,2(— T,  0;  H) 

into  itself  we  can  integrate  by  parts  to  get 

(Azq,Vzi)  +  (zq,VAzi) 

=  J t  {z'o(t),  (Vzi){t))  dt  +  < zo(t ),  (Vz[)(t))  dt 

=  (zo(0),  (^i)(0))  -  (z0(-T),  (Vzi){—T)) 

+  f  (z0(t),(Vz[m-(VZiy(t))dt 

J-T 

=  ( Az0(-T ),  PAzi(—T))  -  z0(—T)Pzi(—T), 

where  the  last  equation  follows  from  the  facts  that  zo(0)  =  Az0(— T),  Zi (0)  = 

Az\{-T ),  and  ( Vz){t )  =  Pz{t).  Thus 

(Az0,Pz1)  +  ( z0,VAZl )  =  (z0(-T),  {A* PA  -  P)Zl(-T)) . 

This  equation,  together  with  (1.4),  (6.3),  and  (6.4),  gives  (1.10). 
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The  Wave  Equation  with  Neuman  Controls: 
On  Lions ’s  F  Space 
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Abstract.  A  crucial  issue  in  boundary  controllability  problems  for  partial  differential 
equations  is  choosing  the  appropriate  regularity  for  the  space  of  controls.  This  space  should 
satisfy  two  criteria:  it  should  be  large  enough,  so  that  controllability  is  possible;  and  it 
should  be  small  enough,  so  that  the  initial  regularity  of  solutions  is  preserved.  Reconciling 
these  two  opposite  criteria  is  sometimes  easy  and  sometimes  very  difficult.  The  aim  of  this 
article  is  to  provide  a  solution  for  this  problem  for  the  wave  equation  with  Neuman  controls. 

1991  Mathematics  Subject  Classification.  93C20 

Key  words  and  phrases.  Exact  zero  controllability,  boundary  control,  wave  equation. 


1.  The  controllability  problem 

We  denote  the  coordinates  in  R  x  1"  by  (cc0  =  t,x  1} ...,xn).  For  convenience  we 
call  t  the  time  coordinate  and  x  the  space  coordinate.  £  stands  in  the  sequel  for  the 
corresponding  Fourier  variable.  Set 

_l_d_ 

J  i  dxj 

Let  D  be  a  bounded  set  in  M”  with  C'2  boundary  dCl.  Then  we  use  the  notations  (•,  •), 
respectively  (•,  • )g  for  the  L2  inner  product  infix  [0, T],  respectively  dQ  x  [0, T]. 
Consider  a  second  order  hyperbolic  partial  differential  operator 

P(t,x,D)=djajkdk 

in  fi  x  [0,  T]  so  that  the  surfaces  t  =  const  are  space-like.  The  corresponding  inho¬ 
mogeneous  Neuman  problem  is 

Pu  =  0  in  fi  x  [0,T], 

/,  n  d„u  =  g  in  <90  x  [0,T], 

'  ’  '  u( 0)  =  ito  in  f2, 

0)  =  u\  in  f l. 

Here  d ^  is  the  conormal  derivative  with  respect  to  the  surfaces  t  =  const, 

=  aokdk 
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and  dv  is  the  conormal  derivative  with  respect  to  the  lateral  boundary  dQ  x  [0,  T], 

dv  =  vjajkOk 

where  v  is  the  unit  outer  normal  to  dQ. 

If  the  coefficients  are  smooth  then  the  homogeneous  problem  (i.e.  with  g  =  0)  is 
well-posed  in  HS(Q)  x  H^fQ)  at  least  if  -1/2  <  s  <  3/2.  For  other  values  of  s  one 
cannot  simply  use  the  Hs  spaces  and  the  appropriate  compatibility  conditions  are 
required  on  the  boundary.  If  the  coefficients  of  P  are  only  C1  then  the  homogeneous 
problem  is  well-posed  in  HS(Q)  x  HS~1(Q)  for  0  <  s  <  1. 

Let  F  be  an  open  subset  of  dQ  x  [0,T],  Then  the  exact  boundary  controllability 
problem  with  Neuman  boundary  controls  localized  in  T  can  be  stated  as  follows: 

Given  any  initial  data  (uq,U\)  G  Hs  x  Hs~x  find  a  boundary  control  g  G  Gs, 
supported  in  r,  so  that  the  solution  u  to  (1.1)  satisfies  u(T)  =  cfyu(T)  =  0. 

The  allowable  range  of  s  depends  on  the  regularity  of  the  coefficients  (see  the 
comments  above).  Since  the  Neuman  problem  is  backwards  well-posed,  one  can  also 
start  at  time  T  with  0  Cauchy  data,  solve  the  equation  backwards  and  try  to  find  g 
so  that  at  time  t  =  0  the  Cauchy  data  is  exactly  (u0,«i). 

The  problem  we  are  interested  in  here  is  that  of  the  choice  of  the  space  of  controls 
Gs.  A  good  choice  (see  [4])  should  have  the  following  properties: 

(C)  Gs  should  be  large  enough,  so  that  controllability  is  possible. 

(CR)  Gs  should  also  be  sufficiently  small,  so  that  the  solutions  to  (1.1)  stay  in  Hs 
between  the  times  0  and  T. 

The  usual  choice  in  the  literature  is  the  obvious  one,  G$  =  H 8-1 .  Under  appropriate 
assumptions  on  the  coefficients  and  on  the  size  of  the  set  T  (see  e.g.  [2],  [3],  [1],  [6]) 
this  space  is  large  enough  to  satisfy  (C).  However,  if  n  >  1  then  it  fails  to  satisfy 
(CR).  Hence,  one  can  infer  that  the  good  choice  of  Gs  is  a  space  slightly  smaller  than 
Hs~x.  The  first  idea  one  is  tempted  to  try  is  to  substitute  Hs~l  by  Hq~1,  q  >  s; 
unfortunately,  in  this  case  (C)  fails.  The  explanation  is  that  in  the  hyperbolic  region 
of  the  cotangent  bundle  of  the  boundary  (which  correspond  to  the  singularities  that 
hit  the  boundary  transversally  and  are  reflected)  the  microlocal  Hs~x  regularity  for 
functions  in  Gs  is  the  correct  one;  it  is  only  in  the  glancing  region  (which  corresponds 
to  singularities  propagating  in  directions  tangent  to  the  boundary)  that  one  needs 
some  better  microlocal  regularity  for  Gs  functions. 

We  conclude  the  discussion  of  the  controllability  problem  with  remarks  about  some 
of  its  features  which  are  only  indirectly  related  to  the  problem  considered  here. 

Remark  1.1.  (On  suitable  geometric  assumptions)  Obtaining  exact  boundary  control¬ 
lability  results  requires  certain  assumptions  on  the  geometry  of  the  controlled  region  T 
relative  to  Q  and  on  the  regularity  of  the  coefficients  of  the  hyperbolic  operator.  The 
two  most  important  such  sets  of  conditions  are 

i)  The  geometric  optics  condition  (see  [1]/,  based  on  the  idea  of  propagating  infor¬ 
mation  along  rays.  This  requires  any  generalized  bicharacteristic  of  P  inQx[0,T]  to 
hit  r  in  a  nondiffractive  point.  A  nondiffractive  point  is  a  point  where  the  ray  would 
leave  the  domain  if  there  were  no  boundary. 

ii)  The  pseudoconvexity  condition  (see  [7]/,  based  on  the  idea  of  propagating  in¬ 
formation  across  pseudoconvex  surfaces.  This  requires  the  existence  of  a  strongly 
pseudoconvex  function  (f>  in  Q  x  [0,  T]  which  is  negative  at  times  0,  T  but  positive 
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at  some  intermediate  time  t  and  which  satisfies  d, ,<f>  <  0  outside  T.  (the  result  was 
proved  in  [6]  for  the  case  when  the  control  is  taken  on  the  entire  boundary;  to  obtain 
the  more  precise  geometrical  condition  on  T  one  needs  to  use  more  refined  Carleman 
estimates  for  solutions  to  boundary  value  problems  as  in  [7] .) 

Remark  1.2.  (The  regularity  of  the  coefficients)  Well-posedness  of  the  hyperbolic  prob¬ 
lem  requires  essentially  C1  coefficients.  Many  of  the  features  of  the  problem  depend  on 
this  regularity.  The  geometric  optics  method  requires  C 2  coefficients.  The  Carleman 
estimates  method  requires  only  Cl  coefficients.  The  range  of  admissible  values  for  s 
also  depends  on  the  regularity  of  the  coefficients.  For  instance,  with  C 1  coefficients 
and  the  equation  in  divergence  form  the  admissible  range  is  0  <  s  <  1. 

2.  Duality  and  the  observability  problem 

Consider  now  the  dual  homogeneous  problem 

(P*v  =  0  in  ft  x  [0,T], 
d„v  =  0  in  3ft  x  [0,  T], 
u(0)  =  v0  in  ft, 

3^(0)  =  vx  in  ft, 

which  is  well-posed  in  the  space  Hq(Q)  x  F9_1(ft). 

To  it  we  associate  the  stable  observability  problem 

Given  the  observation  U|r  in  a  space  Fq  =  Fq,  determine  the  initial  data  (vo,vi)  £ 

Hq  x  Hq~\ 

Now  the  question  is  how  to  choose  the  observation  space  Fq.  It  should  be  small 
enough,  so  that  observability  can  hold;  but  it  should  also  be  large'  enough,  so  that  it 
contains  V|r  for  any  initial  data  (no^i)  G  Hq  x  F9-1.  This  can  be  summarized  in  the 
following  two  inequalities  (O),  which  guarantees  that  stable  observability  holds,  and 
(OR),  which  gives  the  regularity  of  the  observation. 

(0)  |v0|q  +  M,-i  <  c\v\F* 

(OR)  K|,  +  \vi\q-i  >  eM p* 

The  controllability  and  observability  problems  are  dual.  To  make  this  more  precise 
start  with  the  following  integration  by  parts, 

(2.2)  (u,  dpv)  -  (dpU,  v )  |Jz!p  =  (u,  Pv)  -  (v,  Pu)  +  ( u ,  dvv)&  -  (v,  dvu)d 

for  any  smooth  functions  u,v.  Suppose  now  that  u  solves  the  inhomogeneous  problem 
(1.1)  with  zero  Cauchy  data  at  time  T  and  that  v  solves  the  homogeneous  problem 
(2.1).  Then  (2.2)  becomes 

(2.3)  (wo^i)  -  (ui,v0)  =  ~(g,v)d 

which  is  the  duality  relation  which  connects  the  controllability  problem  with  the 
corresponding  observability  problem.  For  the  controllability  problem  one  needs  to 
look  at  the  map 

T  :  G.(T)  ->  Fs(ft)  x  Fs_1(fi),  Tg  =  (u(0),  3Mu(0)) 
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(where  u  is  assumed  to  solve  (1.1)  with  0  Cauchy  data  at  time  T).  Then  the  control¬ 
lability  and  regularity  statements  (C)  and  (CR)  are  equivalent  to  saying  that  T  is  a 
surjective,  respectively  a  bounded  operator. 

For  the  observability  problem,  on  the  other  hand,  consider  the  map 

S  :  Hq{Q)  x  Hq~\ 0)  -►  Fq(T),  T(v0,  m)  =  v{r 

(where  v  is  assumed  to  solve  (2.1)).  Then  (0)  and  (OR)  are  equivalent  to  saying 
that  S  is  bounded  from  below,  respectively  that  S  is  bounded. 

By  (2.3)  the  duality  S  =  T'  is  achieved  if  we  take  q  +  s  =  1  and  Gs  =  F'q. 
The  controllability  and  regularity  statements  (C),  (CR)  are  then  equivalent  to  the 
observability  and  regularity  estimates  (O),  (OR).  Hence,  determining  the  correct 
spaces  Gs  for  the  boundary  controls  reduces  to  finding  the  appropriate  spaces  Fq  for 
controllability. 

Lions’s  idea  is  the  following.  Suppose  we  know  the  uniqueness  result  v  —  0  in  F 
implies  v0  =  V\  =  0.  Then  define  the  norm  of  the  space  F9  exactly  by 

Mfidn  =  Mg  +  Mg-l 

Such  a  space  will  have  the  right  properties,  and  its  dual  is  the  good  space  for  con¬ 
trollability. 

Thus,  it  remains  to  characterize  the  space  Fq.  Intuitively,  the  dependence  of  the 
Fg  norms  on  q  should  be  fairly  simple.  One  would  expect  that  Fq  and  Fr  differ  by 
exactly  q-r  derivatives,  Fq  =  Dr~qFr.  Then  it  is  best  to  characterize  Fq  for  a  given 
value  of  q.  The  simplest  choice,  which  is  used  in  the  sequel,  is  q  =  1.  Then  we  want 
to  identify  the  space  F  =  Fi,  defined  by 

Mir  =  Mff>  +  Ml* 

3.  The  F  space 

Define  the  “tangential”  component  R  of  P  on  the  boundary  dQ  x  [0,  T)  by 
Ru  =  Pu  whenever  dvu  =  d2u  =  0 

This  is  equivalent  to  saying  that  local  coordinates  can  be  chosen  in  which  dQ  = 
{ xn  =  0}  and 

(3.1)  p{x,  0=il~  r(x,  £')  on  {xn  =  0} 

Since  the  boundary  dQ  x  [0,  T]  is  time-like,  it  follows  that  R  is  also  hyperbolic.  If  P 
has  C1  coefficients  then  R  also  has  C1  coefficients. 

Introduce  now  the  spaces  Xs,e  associated  to  R  by 

Xs’0  =  Hs  XsA  =  {ueHs;  RueH3-1}  XS~1  =  HS  +  R*HS+1 
Xs ’e  =  pfs’°,  X8’1^  0  <  0  <  1  Xs-0  =  [Xs'°,  Xs  0  <  9  <  1 
(complex  interpolation).  These  spaces  are  L2  type  Sobolev  spaces  which  have  a 
special  structure  near  the  characteristic  set  of  R.  It  is  easier  to  understand  these 
spaces  in  the  constant  coefficient  case.  Then  the  above  definitions  are  equivalent  to 

x8’e  =  {u\  a  +  iei ni  +  ^fueL2} 
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One  can  see  that  the  index  s  corresponds  to  classical  derivatives,  while  the  index 
9  corresponds  to  “derivatives”  away  from  the  characteristic  set  of  R.  Such  spaces 
have  been  investigated  in  detail  in  [8].  By  interpolation  one  can  easily  prove  that 
they  have  some  good  properties  such  as  microlocalization  and  the  expected  mapping 
properties  for  pseudodifferential  operators  and  R: 


(3.2) 

QPSm  •  Xs'® _ >  xs~m'® 

(3.3) 

^  1,0—1 

On  a  bounded  open  set  F  we  define  the  Xs'6  space  in  the  standard  way,  as  the 
restriction  to  F  of  Xs'6  functions. 

Remark  3.1.  If  the  coefficients  of  R  are  only  C 1  then  the  Xs ,d  spaces  are  well-defined 
and  have  the  above  properties  only  for  |s|  <  1,  |s  —  6\  <  1.  This  is  exactly  the  range 
which  can  be  obtained  by  interpolation  from  the  spaces  X1'1,  X0,1,  Hl  and  their  duals. 

Then  our  main  result  is 

Theorem  3.1.  Suppose  that  P  has  C1  coefficients  and  that  the  boundary  dQ  is  of  class 
C2.  Assume  that  a  suitable  set  of  geometric  assumptions  ( see  Remark  1.1)  is  fulfilled. 
Then  the  F  norm  is  equivalent  to  the  X1/2,1/2  norm. 

Note  that  X0,0  =  L2  and 


X1-1  =  {w  e  Hl  \  Rwe  L2} 

Hence,  if  we  interpret  R  as  a  selfadjoint  operator  in  L2  then  the  space  X1/2,1/2  can 
be  locally  characterized  as 

x-1/2,1/2  =  h1/2  n  D(RV2) 


This  result  leads  to  the  following  optimal  choices  for  the  observability  and  control¬ 
lability  problems: 

Corollary  3.1.  Suppose  that  P  has  C 1  coefficients  and  that  the  boundary  dfl  is  of 
class  C2.  Then  the  Fq  norm  is  equivalent  to  the  Xq~1l2'll2  norm  and  the  optimal 
space  of  controls  Gs  is  Xs-1/2’-1/2,  0  <  s,  q  <  1.  If  both  the  coefficients  of  P  and  the 
boundary  are  smooth  then  the  same  result  holds  for  all  real  s,  q. 

Before  we  prove  the  theorem,  let  us  make  one  important  observation.  The  F  norm, 
as  we  have  defined  it,  applies  only  to  traces  of  solutions  to  the  Neuman  problem,  which 
is  not  a  dense  set  in  any  Sobolev  space.  Hence,  there  is  definitely  more  than  one  norm 
say,  on  smooth  functions,  which  extends  F.  A  striking  example  of  that  is  given  by 
the  following  version  of  the  above  theorem: 

Theorem  3.2.  Suppose  that  P  has  C1  coefficients  and  that  the  boundary  dfl  is  of 
class  C2.  Assume  that  a  suitable  set  of  geometric  assumptions  is  fulfilled.  Then  the 
F  norm  is  equivalent  to  the  H 2/3  fl  X1/2,1/2  norm. 
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The  X1/2’1/2  and  H2/ 3  n  X1/2,1/2  norms  are  certainly  not  equivalent  in  general. 
However,  it  turns  out  that  they  are  equivalent  on  the  function  space  where  F  is 
defined. 

Proof.  We  need  to  prove  the  following  two  estimates 

(3.4)  K|i  +  Mo  <  c|u|xi/2,i/2(d 

(3.5)  Mi  +  Mo  —  cl?;lff2/3nx1/2.i/2(9fix[o,T]) 

The  second  one  is  a  trace  regularity  result  which  was  proved  in  [9].  The  assumptions 
in  [9]  are  that  the  coefficients  are  sufficiently  smooth.  It  now  appears  that  the  same 
result  holds  even  for  C 1  coefficients;  however,  this  will  be  proved  elsewhere. 

To  give  a  complete  proof  to  (3.4)  we  would  need  to  redo  the  appropriate  observabil¬ 
ity  estimates  for  various  sets  of  geometric  conditions  (see  e.g.  [1],[6]).  Fortunately, 
in  all  these  cases  the  boundary  traces  appear  in  the  same  way.  The  common  compu¬ 
tation  done  in  all  these  works  is  local,  near  the  observed  part  of  the  boundary  T.  It 
goes  like  this: 

One  starts  with  a  quadratic  form  of  the  form  (Pu,  Qu ),  where  Q  is  of  order  1  and 
is  either  a  differential  operator  or  an  operator  of  the  form 

Q  =  QoDn  +  Qi 

where  Qo,  Qi  are  tangential  pseudodifferential  operators  of  order  0,  respectively  1, 
with  purely  imaginary  symbol.  The  main  step  is  then  to  compute  by  integration  by 
parts  (commuting) 

2 Re{Pv,  Qv)  =  {Pv,  Qv)  +  (Qv,  Pv)  =  A(v,  v)  +  B{v,  v) 

where  A  is  an  second  order  interior  quadratic  form  and  B  is  a  second  order  boundary 
quadratic  form.  A  is  then  used  to  estimate  the  H1  interior  norm  of  v,  and  B  is 
bounded  by 

B(v,v)  <  c\v\2m[r) 

Our  aim,  therefore,  is  to  refine  this  to 

B(v,v)  <  c\v\2xi/2.i/2 

To  do  that,  we  need  to  determine  what  is  B.  It  is  easier  to  do  that  in  local  coordinates 
where  (3T)  holds.  Then 

P  =  D2n-R ,  Q  =  Q0Dn  +  Qi 
When  we  integrate  by  parts  to  commute  P  and  Q  we  obtain 

2 Re{Pv,  Qv)  =  interior  terms  +  (Dnv,  iQ\v)q  +  (Dnv,  iQ0Dnv)g  +  (iQov,  Rv)g 
Recalling  the  boundary  condition  Dnv  =  0  it  follows  that 

B(v,v)  =  {iQov,  Rv)g 
By  (3.2),  (3.3)  this  can  be  bounded  by 

| B(v,v)\  <  c\Rv\x-^n,-^/AQt>v\x1/'2’1/2  —  clvlx1/2-1/2 
This  concludes  the  proof.  Note  that  the  above  argument  requires  only  C 1  regularity 
of  the  coefficients  of  P.  □ 
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Abstract.  We  consider  an  initial  and  boundary  value  problem  the  one  dimensional  wave 
equation  with  damping  concentrated  at  an  interior  point.  Our  main  results  assert  that  the 
decay  rate  is  uniform  for  regular  initial  data  and  give  lower  estimates  of  the  decay  rate.  An 
essential  intermediate  step  is  the  description  of  the  spectrum  of  the  associate  dissipative 
operator. 

1991  Mathematics  Subject  Classification.  93D15,  93C20,  35B37 

Key  words  and  phrases.  Invariance  principle,  unique  continuation,  strong  stabilization,  de¬ 
cay  rates. 


1.  Introduction  and  statement  of  the  main  results 


The  main  goal  of  the  present  paper  is  to  study  the  asymptotic  behaviour  of  solutions 
for  the  following  initial  and  boundary  value  problem: 


(1.1) 


d^u 

u"(x,  t )  -  —  (:E,  t)  +  w'(a>  =  0,  Vx  e  (0,  1),  Vf  g  (0,  oo) 


(1.2) 


u(0,  t )  =  u(l,  t)  —  0,  Vf  €  (0,  oo) 


(1.3)  u(x,0)  =u°(x),  u'(x,0)  =u1(x),  Vx  G  (0,1), 

where  6a  is  the  Dirac  mass  concentrated  in  the  point  a  G  (0, 1)  and  by  u',  u"  we 
denoted  the  time  derivatives  of  u.  Equations  (1.1)— (1.3)  are  dissipative  since 

(1.4)  E’(t)  =  — |u'(a,t)|2, 


where  E  =  E(t)  is  the  energy 


m  = 


du ,  .  2 

L 

\u’{x,t)\2  + 

dx. 


The  main  known  results  concerning  the  asymptotic  behaviour  of  solutions  of  (1.1)- 
(1.3)  can  be  summarized  as  follows: 


Theorem  1.1.  1.  For  any  a  6  (0, 1)  the  problem  (l.l)-(  1.3 )  admits  a  unique  solution 
u  satisfying 


u  G  C([0,  oo),  ^(0, 1))  n  C1([0,  oo),  L2(0, 1)). 

Preceding  Page  Blank 
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2.  The  solution  u  of  (1.1 )-( 1.2,)  has  the  decay  property 
(1.5)  Hm(||u(t)||Hi(o,i)  +  IK(£)IU2(o,i))  =  0. 

if  and  only  if 


(1.6)  a  G  (0,1)D(R-Q). 

3.  For  any  a  satisfying  (1.6)  the  decay  of  u  to  zero  is  not  uniform  in  the  energy 
space.  More  precisely ,  for  any  function  if  :  [0,oo)  — >  E  with  limi_>0O^(f)  =  0  there 
exists  a  sequence  (tn),  tn  — »  oo  and  a  solution  u ^  of  (1.1)-(1.2>)  such  that 


(1.7) 


||{M»/>(4),^ftn)}|Ui(0,l)xZ,2(0,l)  ,,  ,  w  >  i 

H{t**(o),t4(o)}||fl8(0,1)x£8(0>1)  ~nnh  - 


The  results  in  Theorem  1.1  were  essentially  proved  in  [2]  and  [7].  However,  for  the 
sake  of  completeness  we  shall  sketch  the  proof  in  section  2. 

In  the  present  paper  we  shall  prove  that  if  we  assume  some  additional  smoothness  of 
the  initial  data  we  can  achieve  a  definite  rate  of  decay  for  the  solutions  of  (1.1)— (1.3). 
Moreover  we  shall  show  that  the  energy  decays  at  most  as  a  negative  power  of  time. 
In  order  to  state  the  precise  result  we  shall  consider  a  subspace  of  Hq(0,  1)  x  L2(0, 1) 
defined  by 

(1.8)  V(A)  = 

3 'll, 

=  {(u,  v)  G  [JTj(0,  l)]2|u  G  H\0,a)  n  H2{a,  1),  ^(o+)  -  ^(a-)  =  u(a)}, 
endowed  with  the  norm 

(1-9)  ll(w,v)|||(^)  =  |MIh2(0,o)  +  IMItf2(a,l)  +  IMItfW)' 

Our  first  result  on  the  uniform  decay  of  solutions  of  (1.1)— (1.3)  is 

Proposition  1.1.  For  any  a  satisfying  (1.6)  there  exists  a  function  ha  :  [0,  oo)  — »  E 
with  lim  ha(t)  =  0,  such  that  the  solution  u  of  (l.l)-(l.Z)  satisfies 

t — KX) 

(1-10)  ||{'«(f),  u'(t))  ||  #1(0,1)  xL2  (0,1)  <  /laWH(w°,W1)||I>(^), 

V(u°,u1)  G  V(A),  Vt>  0. 


Our  main  result  shows  that  for  any  irrational  a  the  function  ha  in  Proposition  1.1 
tends  to  zero  at  most  as  ^=.  More  precisely  we  have 


Theorem  1.2.  For  any  a  satisfying  (1.6)  there  exists  a  sequence  tn 
sequence  (un)  of  solution  of  (1.1)-(1.2)  such  that 


lim  VtC 


\\{un(tn),  Un(tn)}  |[#i(0,l)xL2(0,l) 

||{un(0),<(0)}||C(^) 


C  G  (0,  oo). 


oo  and  a 


(1.11) 
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Moreover  for  any  6  >  0  there  exists  a  constant  a  satisfying  (1.6)  such  that 


(1.12) 


S  H{Mnftn);<ftn)}lUi(0,l)xL2(0,l) 

||K(o)x(o)}IM 


for  some  un  and  tn  as  above. 

The  plan  of  this  paper  is  as  follows:  in  the  second  section  we  prove  some  preliminary 
results  including  Theorem  1.1;  in  the  third  section  we  prove  the  main  results;  we  end 
up  with  a  section  devoted  to  further  comments  and  other  related  questions. 


2.  Wellposedness  and  strong  stabilization  results 

We  shall  first  study  the  wellposedness  of  (1.1)-(1.3)  by  using  the  theory  of  semi¬ 
groups.  With  V{A)  defined  by  (1.8)  and  X  =  H^( 0,1)  x  L2(0,1)  we  consider  the 
operator  A  :  'D(A)  — >  X  defined  by 


(2.1) 


The  wellposedness  theorem  for  (1.1)— (1-3)  will  be  a  simple  consequence  of  the  follow¬ 
ing  result: 


Lemma  2.1.  The  space  V{A)  is  dense  in  X  and  -A  is  the  generator  of  a  continuous 
semigroup  of  contractions  in  X. 


Proof.  The  proof  of  the  density  of  V(A)  in  X  is  a  simple  exercise  so  we  shall  skip  it. 
If  we  suppose  that  X  is  endowed  with  the  scalar  product 


du-t  du 2 

— —  — — 1-  viv2 
dx  dx 


dx, 


a  simple  calculation  shows  that 

(•4(“)'(“)),  =  2',2<“)-°'v(“)eW>' 
so  A  is  monotone.  In  order  to  prove  that  -A  generates  a  semigroup  of  linear  con¬ 
tractions  on  X  it  suffices  to  show  that  A  is  onto,  i.e.  A{T>{A))  =  X.  Let  (^j  G  X 
and  consider  the  equation 


(2.2) 


which  can  be  written  as 

(2.3)  t»  =  -/fffj(0,l), 

(2.4)  -~+v(a)5a  =  heL2(  0,1). 

From  (2.3),  (2.4)  and  a  simple  elliptic  regularity  result  it  follows  that  equation  (2.2) 
admits  a  unique  solution  G  X?(^4) ,  so  A  is  onto.  □ 
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In  order  to  study  the  asymptotic  behaviour  of  the  solutions  of  (1.1)— (1.3)  we  shall 
need  the  following  compactness  result. 

Lemma  2.2.  The  space  T>(A)  is  compactly  embedded  in  X  and  the  operator  A~l  is 
compact  from  X  into  X. 

Proof.  We  first  notice  that  AT1  is  a  linear  continuous  isomorphism  from  X  onto 
'D(A)  and  from  L2(ffl)  x  H~1( Q)  onto  X.  It  suffices  then  to  use  the  fact  that  X  is 
compactly  embedded  in  L2(Q)  x  H~l{Pl).  □ 

Proof  of  Theorem  1.1.  In  order  to  prove  the  first  assertion  it  suffices  to  notice  that 
(1.1)— (1.3)  can  be  written 


and  to  apply  Lemma  2.1. 

In  order  to  prove  (1.5)  consider  the  problem 

(2.5)  w"  -  =  0,  in  (0, 1)  x  (0,  oo) 

(2.6)  w(0,  t)  =  w(l,t)  =  0,  Vt  €  (0,  oo), 

(2.7)  w\a,  t )  =  0,  Vf  £  (0,  oo). 

A  simple  Fourier  expansion  of  w  combined  with  the  independence  of  complex  expo¬ 
nentials  show  that  the  only  function  w  satisfying  (2.5)-(2.7)  is  w  =  0.  By  applying 
a  version  of  the  invariance  principle  of  LaSalle  (cf .  [3]  and  [4])  we  obtain  now  that  u 
satisfies  (1.5). 

We  still  have  to  prove  the  third  assertion  of  Theorem  (1.1).  We  first  notice  that 
if  a  is  rational  then  A  admits  purely  imaginary  eigenvalues.  If  a  is  irrational  then 
it  can  be  approached  by  a  sequence  of  rationals  and  one  can  show  (see  Lemma  3.1 
below  for  details)  that  there  exists  a  sequence  ptn  of  eigenvalues  of  A  such  that 

(2.8)  Tlenn  — >  0,  I'm  \xn  — *■  oo. 

Denote  now  by  S(t)  the  semigroup  generated  by  A  and  by  C(X,  X)  the  space  of 
linear  operators  from  X  into  X.  Relation  (2.8)  implies  the  estimate 

|| 5(f)  ||  <  Vt  >  0, 

is  false  for  any  to,  M  >  0.  It  suffices  now  to  apply  a  result  from  [9]  to  obtain  assertion 
3  of  Theorem  1.1. 

3.  Decay  estimates 

In  this  section  we  shall  prove  Proposition  1.1  and  Theorem  1.2. 

Proof  of  Proposition  1.1.  It  suffices  to  show  that 
(3.1)  hm  ||S,(t)||£(x>(J4),x)  =  0, 

where  S(t)  is  the  semigroup  generated  by  A  and  C(T>(A),  X)  is  the  space  of  linear 
bounded  operators  from  T>(A)  into  X.  In  this  case  estimate  (1.10)  holds  true  with 
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ha(t)  =  ||S,(t)||jC(x>(^)ix)-  Let  us  suppose  that  (3.1)  is  false,  i.e.  that  there  exists  e  >  0, 
tn  -»  oo  and  (Un)  C  T>(A),  ||C4||i>(^)  =  1  such  that 

(3.2)  ||S(t„)£4||x  >  c,  Vn  >  1. 

As  by  Lemma  2.2  V(A)  is  compactly  embedded  in  X  there  exists  a  subsequence  (t/nJ 
of  ([/„)  and  U  £  X,  U  ^  0  such  that 

(3.3)  lim  -  U\\x  =  0- 

k— >oo 

As  S'(i)  is  a  semigroup  of  contractions  relation  above  and  (3.3)  imply  that 

(3.4)  lim  \\S(tnk)Unk  -  S{tnk)U\\x  =  0. 

k— >oo 

From  (3.2)  and  (3.4)  we  obtain  that 

\\S(tnk)U\\x  > 

for  k  large  enough,  which  obviously  contradicts  (1.5).  □ 

An  essential  intermediate  step  in  the  proof  of  Theorem  1.2  is  the  study  of  the 
eigenvalues  and  of  the  eigenvectors  of  the  operator  A  :  T>(A)  —■ ►  Hq(0,  1)  x  L2( 0, 1), 
where  V(A)  is  defined  by  (1.8)  and  A  is  defined  by  (2.1).  The  eigenvectors  and 
eigenvalues  of  A  are  characterized  by  the  following  result: 

Lemma  3.1.  If  a  £  (0, 1),  a  &  Q  then  a  complex  number  A  is  an  eigenvalue  of  A  if 
and  only  if  A  satisfies  the  equation 

(3.5)  3  -  e2A  -  e2aA  -  e2(1_o)A  =  0. 


All  the  eigenvalues  of  A  are  simple  and  the  eigenvector  \^pj  corresponding  to  the 
eigenvalue  —X  is  given  by 


(3.6)  <p\{x) 


0  <  x  <  a, 


eK  —  ’ 

gA(iE-l)  _  g— A(x— 1) 

— pr,  a  <  X  <  1. 


eA(£— 1)  _  e-A(«-l) 

Proof.  A  simple  calculation  shows  that  relation 


A 


<P\ 

lp  A  I  ’ 


holds  true  if  and  only  if 
(3.7) 


and 

(3.8) 


d24>\ 
dx 2 


'fpx(x)  =  -A  <j>x(x). 


ipx{x)  =  -X<t>x(x), 


(x)  +  X2(j>x{x)  =  0,  x  £  (0,  a)  U  (a,  1), 


(3.9) 


<Ma+)  =  ^a(o-)  =  (f>x(a), 
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(3.10) 


(3.11)  </>a(0)  =  <f>x(l)  =  0. 

If  we  suppose  that  (j>\ (a)  =  0  and  we  use  that  a  $  Q  relations  (3.8)-(3.11)  imply  that 
<j>x(x)  =  0,  Vx  G  (0, 1).  This  is  why  we  shall  admit  that  =  1.  From  (3.8),  (3.9), 
(3.11)  we  can  then  obtain  that  <£a,^a  satisfy  (3.6).  Condition  (3.10)  (with  =  1) 
implies  now  that  A  satisfies  (3.5).  □ 

We  can  give  the  main  proof  of  this  section. 

Proof  of  Theorem  1.2.  Let  us  make  the  notation 


G(a,  A)  =  3  -  e2A  -  e2aA  -  e2Cl”a)A, 

with  a  satisfying  (1.6)  We  first  notice  that  the  first  assertion  of  Theorem  1.2  follows 
from  the  existence  of  a  sequence  ( zn )  C  C  such  that  zn  is  an  eigenvalue  of  A  and 


0  <  7 Zezn  <  T — — -,  \lmzn\  — *•  oo.  In  this  case  the  sequence  of  solutions  of  (1.1)- 
j2^77lZnj 

(1.2)  with  initial  data  {^(aO,  0}  satisfies  (1.11)  with  tn  =  \Imzn\2 .  By  Lemma  3.1 
a  sequence  ( zn )  satisfies  conditions  above  if  and  only  if 


(3.12) 


Zn  —  %n  fyni  Un 


oo,  0  <  xn  <  G(a,zn)  =  0. 
Vn 


In  order  to  prove  the  existence  of  a  sequence  ( zn )  satisfying  (3.12)  we  first  notice 
that  by  Theorem  5  in  [1]  there  exist  two  sequences  ( pn ),  ( qn )  C  Z  such  that 


(3.13) 


Pn 
a  —  ■ — 
Qn 


5-2 

<  —5-,  Vn  >  1,  q„  -*  00. 


By  Rouche’s  theorem  (cf.  [10,  p.  243])  it  suffices  to  prove  the  existence  of  a  constant 
K  >  0  such  that 


(3.14) 


dG 

G(a,  A)  -  -^-(a,  qnm){\  -  qnm) 


< 


< 


dG 

d\ 


(a,  qnTri)(X  -  qniri) 


•ru  -I  K 

,  if  \\-qnm\  =  — . 

Qn 


We  shall  first  estimate  the  left  hand  side  of  (3.14)  by  writing  the  Taylor  expansion 


(3.15) 


dG 

G(a,  A)  -  -  q„ni)  = 

1  dmG 

=  G(a,  qnTti)  +  J2  ^7r2)(A  -  ^7r*)” 


m>  2 


Concerning  the  first  term  in  the  right  hand  side  of  (3.15)  a  simple  calculation  gives 

m  -\  n  2(a-£a)gniri  „2(;rL_a)g„7rj  _ 

G(a,qnm)  =  2  —  e  ^  «»'  —  e  /  _ 

Pn 


=  4  sin 


n  |  qn'tt 

qn 
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By  using  (3.13)  relation  above  implies  the  existence  of  ni  >  0  such  that 

47 

(3.16)  \G(a,q„m)\  <  — y,  Vn  >  m. 

0l?n 

We  shall  now  estimate  the  infinite  sum  in  the  right  hand  side  of  (3.15).  We  can  easily 


We  shall  now  estimate  the  infinite  sum  in  the  right  hand  side  of  (3.15).  1 
check  that 

(3.17)  |^(a, A)  =  ~2m[e2A  +  ame2aX  +  (1  -  o)me2(1-“)A],  Vm  >  1 

which  implies  that 


'-(a,  qnm)  <  3  •  2m,  Vm  >  1. 


K 

If  I A  —  (7n7ri|  =  —  inequality  above  implies  the  existence  of  n2  >  0  such  that 

Ql 


(3.18) 


1 

E  ^!5^r(a>rf(A-9n7ri) 


,  m!  <9Am  ’ 
,,  2K 


ql  '  “  4^ 


By  combining  (3.15),  (3.16)  and  (3.18)  we  obtain 
(3.19)  G(a,  A)  -  ~(a,  qn m)(X  -  g„7rz)  <  ^ 

Vn  >  max(ni,n2),  |A  -  qn7ri|  = 

Vn 

On  the  other  hand  from  (3.17)  it  follows  that 

^-(a,  qnm)  =  -2  fl  +  ae2^"-)9"”  +  (1  -  a)e2(- 
u\ 


which  implies  that 


j^-(a,qn m)  >  2  1 1  +  2 cos  (a-^Jqn tt  |. 

Prom  relation  above  and  (3.13)  we  obtain  the  existence  of  n3  >  0  such  that 


—  (a,  qnm){\  -  qnm)  >  -y 
oa  qn 

,  ,  K 

Vn  >  n3,  |A  -  q„m\  =  -y. 


'in 

Finally  by  combining  (3.19)  and  (3.20)  we  obtain  that  (3.14)  holds  true  for  all  K  > 
provided  that  n  is  large  enough,  so  we  obtain  the  first  assertion  of  Theorem  1.2. 
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Let  us  now  suppose  that  6  >  0.  According  to  [14]  we  can  find  a  E  (0, 1)  which 
is  transcendental  and  such  that  there  exist  two  sequences  (p„),  (qn)  C  Z  with  (qn) 
strictly  increasing  such  that 

<  ^r,  Vn  >  1. 

€ 

By  using  (3.21)  and  the  estimates  already  proved  in  the  first  part  of  this  proof  one 
can  easily  show  that  the  second  assertion  of  1.2  also  holds  true.  □ 


(3.21) 


a  — 


Pn 


4.  Comments  and  related  questions 

The  results  in  this  paper  can  be  generalized  to  the  case  of  n-space  dimensions,  for 
the  following  problem: 

(4.1)  u"  —  A u  +  g(u')6y  =  0,  in  Cl  x  (0,  oo) 

(4.2)  u  =  Q,  on  T  x  (0,  oo) 


(4.3) 


u(x,  0)  =  u°(x),  u'(x,0)  =  v}(x),  in  Cl, 


where: 


(a)  Cl  is  an  open  bounded  subset  of  R”  with  regular  boundary  T. 

(b)  7  =  duj,  where  w  C  u>  C  fl  is  an  open  set  with  regular  boundary. 

(c)  <57  is  the  Dirac  mass  concentrated  on  7  and  the  function  g  :  K  — >  R  is  supposed 
to  be  continuous  and  strictly  monotone,  with  g(0)  =  0. 

The  two  dimensional  version  of  (4.1)-(4.3)  was  treated  in  [8]  where  it  was  proved 
that  the  solutions  decay  to  zero  for  almost  all  w  C  Cl.  In  the  same  paper  it  was 
proved  that,  in  the  one  dimensional  case,  there  exist  a  E  (0, 1)  such  that  the  solution 


of  (1.1)— (1.3)  goes  to  zero  like  y  j. 

A  question  related  to  the  problem  studied  in  this  paper  is  the  stabilization  of  elastic 
plates  by  the  use  of  piezoelectric  actuators  (see  [5]  for  appropriate  models  and  [11], 
[12],  [13]  for  the  associated  control  problems).  In  this  case  new  difficulties  arise  as 
the  control  function  is  scalar  valued,  so  one  may  hope  that  strong  stabilization  holds 
only  in  the  case  of  simple  eigenvalues. 

In  general,  the  methods  used  in  this  paper  apply  for  a  large  class  of  equations  of 
the  form  u"  +  Au  +  g(u')S7  =  0  including  the  plate  equation  with  various  boundary 


conditions. 
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Exact  Controllability  of  the  Generalized 
Boussinesq  Equation 

BING- YU  ZHANG 

Department  of  Mathematical  Sciences 
University  of  Cincinnati 

Abstract.  In  this  paper  we  consider  distributed  control  of  the  system  described  by  the 
generalized  Boussinesq  equation 

Utt  —  tixx  (u(u)  lLj-.x')xx  "b  f 

on  the  periodic  domain  S,  the  unit  circle  in  the  plane. 

In  the  case  of  local  control,  if  the  control  /  is  allowed  to  act  on  the  whole  domain  S,  it 
is  shown  that  the  system  is  globally  exactly  controllable.  In  the  case  of  local  control  where 
the  control  /  is  only  allowed  to  act  on  a  sub-domain  of  S,  we  show  that  the  same  result 
holds  if  the  initial  and  terminal  states  have  “small  amplitude”  in  a.  certain  sense. 

1991  Mathematics  Subject  Classification.  Primary  35Q20,  93C20 

Key  words  and  phrases.  Boussinesq  equation,  exact  controllability,  contraction  principle. 


1.  Introduction 

In  the  present  work  we  consider  distributed  control  of  a  class  of  equations  which 
may  be  described  as  being  of  generalized  Boussinesq  type.  They  have  the  general 
form 

(1-1)  Utt  uXx  =  (u(n)  ux x)xx  -b  f 

in  which  u  =  u(x,t),  x,t  £  R  and  the  subscripts  denote  the  corresponding  partial 
derivatives,  a  :  R  — ►  R  is  a  smooth  function  with  o(0)  =  0.  The  equation  (1.1)  is 
a  perturbation  of  the  linear  wave  equation  which  takes  into  account  effects  of  weak 
nonlinearity  and  dispersion,  and  appears  in  the  theory  of  nonlinear  strings. 

The  classical  Boussinesq  equation  is  of  the  form 

3 

(1.2)  lift  uxx  +  -(u  )xx  T  buxxxx  0, 

and  was  derived  by  Boussinesq  [2]  in  1872  as  a  model  for  the  propagation  of  small 
amplitude,  long  waves  on  the  surface  of  water.  It  possesses  special,  traveling-wave 
solutions  called  solitary  waves.  Historically,  Boussinesq’s  theory  [2]  was  the  first 
to  give  a  satisfactory,  scientific  explanation  of  the  phenomenon  of  solitary  waves 
described  by  Scott-Russell  thirty  years  earlier.  Depending  on  whether  the  coefficient 
b  in  the  equation  (1.2)  is  positive  or  negative,  the  equation  (1.1)  is  called  the  “good” 
Boussinesq  equation  or  the  “bad”  Boussinesq  equation.  The  “bad”  version  is  used 
to  describe  a  two-dimensional  flow  of  a  body  of  water  over  a  flat  bottom  with  air 
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above  the  water,  assuming  that  the  water  waves  have  small  amplitudes  and  the 
water  is  shallow.  It  also  appeared  in  a  posterior  study  of  the  Fermi-Pasta-Ulam  (FPU) 
problem,  which  was  performed  to  show  that  the  finiteness  of  the  thermal  conductivity 
of  an  anharmonic  lattice  was  related  to  nonlinear  forces  in  the  springs.  However,  the 
“bad”  Boussinesq  equation  is  notorious  for  its  initial  value  problem  (IVP)  being  not 
well-posed  even  locally  (in  time).  Of  the  “bad”  Boussinesq  equation  only  solutions  of 
soliton  type,  which  can  be  found  using  the  inverse  scattering  method,  are  known.  For 
this  reason,  we  only  consider  a  generalized  version  of  the  “good”  Boussinesq  equation 
(1.1). 

Our  main  concern  is  the  study  of  equation  (1.1)  from  a  control  point  of  view.  In 
particular,  we  consider  the  equation  posed  on  a  periodic  domain  S,  the  unit  circle  in 
the  complex  plane 


(1.3)  Utt  tixx  —  (u(t/)  ttxx)xx  T  f  i  x  £  S,  t  C  R 

with  the  forcing  function  /  =  f(x ,  t )  as  a  control  input.  The  goal  is  to  influence  the 
system  by  choosing  an  appropriate  input  /. 

The  control  theory  of  Boussinesq-type  equations  was  initiated  by  Liu  and  Russell 
[6],  [7]  and  [8].  Both  distributed  control  and  boundary  control  of  the  Boussinesq 
equation  have  been  considered.  Some  dissipative  mechanism  is  introduced  into  the 
systems  through  appropriate  feedback  control  laws.  They  showed  that  the  small 
amplitude  solutions  of  the  resulting  closed  loop  system  are  then  exponentially  stable. 

In  this  paper  we  consider  the  exact  control  problem:  choose  an  appropriate  control 
input  f(x,t)  to  guide  the  system  described  by  (1.3),  during  time  interval  [0,T],  from 
a  given  initial  state  to  another  preassigned  terminal  state  in  an  appropriate  function 
space  of  system  states. 

Note  that  for  an  appropriately  smooth  solution  u(x,t )  of  the  unforced  equation 
(f(x,  t)  =  0)  it  is  easy  to  see  that  any  smooth  solution  u  satisfies 

4  [  ut(x,t)dx  =  0 
dtJs 


for  any  t  G  R.  Therefore 


J  ut(x,t)  =  J^ut(x,0)d 


x 


J  u(x,t)  =  J  u(x,  0)dx  + 1 J  ut(x,0)dz 


for  any  t  €  R.  Usually  one  chooses  the  initial  value  ut(x,  0)  with  fsut(x,  0 )dx  =  0  (cf. 
[1])  so  that  both  Jsut(x,t)dx  and  Jsu(x,t)dx  are  conserved  for  the  unforced  system. 
In  order  to  keep  these  quantities  conserved  while  conducting  control  we  require  that 


the  control  input  /  in  system  (1.3)  satisfies 


(1.4) 


J  f(x,  t)dx  =  0,  Vt  €  R. 


A  more  interesting  case  is  obtained  if  some  further  a  priori  restrictions  are  imposed 
on  the  applied  control  f(x,  t ).  Let  us  suppose  that  g(x)  be  a  smooth  function  defined 
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on  S  with  its  support  contained  in  S  satisfying 

[g]  ■=  J  g(x)  dx  =  1 

where  [5]  denotes  the  mean  value  of  the  function  g  over  the  circle  S.  We  restrict  our 
attention  to  a  control  of  the  form 

(1.5)  f(x,  t)  =  Gh  :=  g{x)  (h(x,  t)  ~  Jg  9{v)Kv >  Wv)  ■ 

Thus  h(x,t)  may  be  considered  as  a  new  control  input.  It  is  easy  to  check  that 

/  f(x,t)  =  0 
J  s 

with  /  given  by  (1.5);  therefore  the  restriction  (1.4)  is  satisfied.  Depending  on  the 
support  of  the  function  g  in  the  domain  S,  there  are  two  different  control  situations. 
If  the  support  of  g  is  the  whole  domain  S  then  the  control  acts  on  the  whole  domain 
and  we  refer  to  it  as  global  control.  If  the  support  of  the  function  g  is  a  proper 
subset  of  S,  the  control  acts  only  on  a  sub-domain  and  we  refer  to  it  as  local  control. 
Obviously  we  have  more  control  power  in  the  global  control  situation  than  in  the 
local  control  case.  On  the  other  hand,  the  local  control  situation  includes  more  cases 
of  practical  interests  and  is  therefore  more  relevant  in  general. 

Now  we  describe  the  main  results  of  this  paper.  Let  HS(S)  ( s  >  0)  be  the  space 
of  all  functions  of  the  form 

OO 

v(x)  =  J2vkeikx 

— OO 

such  that 

(  00  '1 1/2 

(1.6)  {EM2(1  +  |*|)*j  <+00. 


The  left  hand  side  of  (1.6)  is  a  Hilbert  norm  for  H*(S );  we  denote  it  by  ||u||s. 

For  the  control  problem  just  introduced  the  exact  control  problem  consists  in  using 
the  indicated  control  function  to  guide  the  system,  during  [0,  T],  between  a  given  pair 
of  initial  states  u(x,  0)  =  (j>o(x )  and  ut(x,  0)  =  ipo(x)  and  a  given  pair  of  terminal  states 
u(x,T)  =  4>t{x)  and  ut(x,T)  =  iPt(x),  in  an  appropriate  function  space  of  system 
states,  necessarily,  in  view  of  the  conserving  control  actions  under  consideration,  such 
that 


(1.7)  J  ipo{x)dx  =  J  tpT{x)dx  =  0,  J  4>o(x)dx  =  J  cj>T{x)dx. 


In  the  global  control  case,  the  control  h  acts  on  the  whole  domain  S  and  we  have 
the  following  strong  controllability  result. 


Theorem  1.1.  Let  T  >  0  and  s  >  0  be  given  and  assume  that  the  function  g  in  ( 1.5,) 
satisfies 


(1.8) 


\g(x)\  >  p  >  0, 
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Then  for  any  ipo),  {4>t,  V>t)  e  Hs+2(S)  x  HS(S)  satisfying  (1.7)  there  exists  a 
control  function  h  E  L2( 0,  T;  HS(S)  such  that  the  equation 

U'tt  uxx  =  (u(u)  uxxjxx  T  Gh 

has  a  solution  u  E  C([0,T];Hs+2(S))  n  C^QO, T];  HS(S ))  satisfying 
u(x,  0)  =  4>o(x),  ut(x,  0)  =  tp0(x), 

u(x,  T)  =  f)T(x),  ut(x,  T )  =  ipT{x)- 

In  other  words,  we  have  “global”  exact  controllability  in  the  global  control  case. 

In  the  local  control  case,  the  support  of  the  function  g  may  be  a  very  small  part  of 
the  domain  S';  thus  our  control  power  is  quite  limited.  In  this  situation,  we  have  the 
following  “local”  exact  controllability  result. 

Theorem  1.2.  Let  T  >  0  and  s  >  0  be  given.  Then  there  exists  a  S  >  0  such  that  for 
any  {<Po,iPo),(4>t,iPt)  e  HS+2(S)  x  Hs{S)  satisfying  (1.7)  and 

ll^olls+2  +  ||<£t||s+2  <  IIV'olls  +  IIV’tIIs  <  h, 
there  exists  a  control  function  h  €  L2(0,T;  H*(S)  such  that  the  equation 

Utt  uxx  —  (a(u)  uXx)xx  "b  Gh 

has  a  solution  u  E  C'([0,T];/fs+2(S'))  nC'1([0,T];/7s(5))  satisfying 
u(x,  0)  =  4>0(x),  ut{x,  0)  =  ip0(x), 
u(x,  T)  =  Ut{x,  T)  =  ipT{x). 

The  paper  is  organized  as  follows.  In  section  2,  we  consider  well-posedness  of 
the  initial  value  problem  of  the  forced  generalized  Boussinesq  equation  posed  on  the 
periodic  domain  S: 

{Wtt  ~  uxx  —  [a(u)  uxx)xx  +  /,  x  E  S,  t  E  Ft 
u(x ,  G)  =  4>(x),  ut(x,  0)  =  ip(x) 

As  it  is  known  this  is  equivalent  to  considering  periodic  solutions  of  the  equation  posed 
on  R.  The  local  well-posedness  of  this  problem  could  be  established  either  by  Kato’s 
semigroup  approach  (see  Bona  and  Sack  [1])  or  by  Bona  and  Smith’s  regularization 
approach  (see  Liu  and  Russell  [7]).  But  in  this  paper  we  provide  a  direct  and  simpler 
approach  with  the  contraction  principle .  The  advantage  of  this  approach  is  that  one 
not  only  obtains  the  well-posedness  of  the  problem  but  is  also  able  to  show  that  the 
solution  depends  analytically  on  its  initial  data  and  the  forcing  term.  It  should  be 
pointed  out  that  one  only  expects  a  local  well-posedness  result  for  the  IVP  (1.9). 
Some  solutions  of  the  IVP  (1.9)  may  blow  up  in  finite  time  even  though  their  initial 
data  and  the  forcing  term  are  smooth  (see  [5]).  In  section  3,  we  conduct  a  spectral 
analysis  of  the  operator 

0  1 

A-{d2x-di  0 

defined  in  the  space  Xs  =  Hs+2(S )  x  HS(S)  for  s  >  0.  We  show  that  the  operator  A 
is  a  discrete  spectral  operator  and  its  eigenvectors  form  a  Riesz  basis  of  the  space  Xs. 


Bing- Yu  Zhang 


301 


The  result  established  in  this  section  is  the  basis  to  obtain  our  main  exact  control¬ 
lability  results  for  the  generalized  Boussinesq  equation  (1.1).  The  proof  of  our  main 
results  of  this  paper.  Theorem  1  and  Theorem  2,  are  provided  in  section  4.  As  in  our 
earlier  joint  paper  with  Russell  [10]  which  dealt  with  the  same  control  problem  for 
the  Korteweg-de  Vries  equation,  we  consider  first  a  linear  system  associated  with  the 
nonlinear  system  (1.1): 

(Utt  Uxx  "b  'U'xxxx  Gh,  x  (E  S ^  t  (E  R 
u(x,  0)  =  ut( x,  0)  =  ip{x) 

We  will  show  that  the  system  is  exactly  controllable  in  the  space  Hs+2(S)  x  HS(S )  for 
any  s  >  0.  Moreover,  we  show  that  there  exists  a  bounded  linear  operator  KT  from 
the  initial/terminal  state  pair  ((f>o,ipo),  each  in  the  space  Hs+2(S)  x  HS(S), 

to  the  corresponding  control  h  in  the  space  L2([0,T];  HS(S)).  Then  the  proofs  of 
Theorem  1.1  and  Theorem  1.2  follow  from  the  same  argument  used  in  [10]. 


2.  Well-posedness 

In  this  section  we  establish  the  well-posedness  of  the  initial  value  problem  of  the 
forcing  general  Boussinesq  equation  on  a  periodic  domain  S, 

(  lift  =  UXx  (&(?/)  ~b  'U'xx'jxx  T  f  X  G  S,  t  £  R, 


[  u(x,  0)  =  <p(x),  ut{x,  0)  =  ip(x), 
via  the  contraction  principle  approach. 

Theorem  2.1.  Let  s  >  0  and  T  >  0  be  given.  Then  for  any  <p  6  Hs+2(S),  ip  £  Hs(S ) 
and  f  G  L^OjT;  HS(S)),  there  exists  a  T*  >  0,  depending  only  on  ||0||s+2,  HV’II  and 
||/l|i1(o.riff*(S))>  suc-h  that  the  IVP  (2.1)  has  a  unique  solution  u  €  C([0,  T*];  HS+2(S)) 
with  ut  G  C([0,T*];  HS{S)).  In  addition,  the  solution  depends  continuously  on  the 
initial  data  (j>,  ip  and  the  forcing  term  f  in  the  respective  spaces. 

Before  we  present  the  proof  of  the  theorem,  we  rewrite  (2.1)  as  the  following  equiv¬ 
alent  first  order  evolution  equation 


where 


-~-u  =  Au  +  F(u)  +  g,  u(  0)  =  uo 
dt 


0  1 
d2-di  0 


u)  F(i0  “  (  (o(V )  ■  *“(/)’  "0  = 

For  any  s  >  0  and  T  >  0,  let  As  denote  the  Hilbert  space 

=  HS+2(S )  x  HS(S) 
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equipped  with  the  norm 

NU.  =  (IMJ  +  II«**II2  +  IMI;f)1/2 

for  any  u  =  ^  ^  ^  £  Xs.  In  addition,  let 

Yt<s  =  C([0,T]-Xs). 

It  is  easy  to  see  that  the  operator  A  is  a  linear  operator  from  Xs  to  Xs  with 
V(A)  =  H*+4(S)  x  HS(S).  Besides,  the  adjoint  operator  of  A 

A*  =  -A. 

The  operator  A  generates  an  isomorphic  group  W  (t)  on  the  space  Xs  for  any  s  >  0 
and  the  standard  semigroup  theory  gives  us  the  following  estimates. 

Proposition  2.1.  Let  T  >  0  and  s  >  0  be  given.  There  exists  a  constant  c  >  0  such 
that 

(2.5)  sup  || W(t)u\\x,  <  Nk 

te(o,T) 

for  any  u  £  Xs  and 

(2.6)  sup  ||  f  W{t-T)f(T)dT\\xs  <  c||/|U‘(0,T;Xs) 

t€(0,T)  JO 

for  any  f£  L1(0,T;X2). 

Using  the  notations  we  just  introduced,  Theorem  2.1  can  be  restated  as  follows: 

Theorem  2.1'.  Let  s  >  0  and  T  >  0  be  given.  Then  for  any  u0  £  Xs  and  g  £ 
Ll(0,T;Xs),  there  exists  a  T*  >  0  depending  only  on  ||wolUs  and  ||<f||i,i(o,r;;fs)  such 
that  the  IVP  (2.2)  has  a  unique  solution  u  £  Yt*iS  and  the  corresponding  solution 
map:  ( u0,f )  £  Xs  x  L1{  0,T;X.)  YT\S  is  continuous. 

Proof  of  Theorem  2.1'.  Using  the  notation  of  the  semigroup  W (t)  we  may  write 
(2.2)  in  its  integral  form 

(2.7)  u(t )  =  W(t)uo  +  W(t  -  t)(F(u )  +  g)(r)dT 

It  suggests  us  to  consider  the  map  T  defined  on  the  space  U[0,  T ;  Jf,],  for  given 
u0  £  Xs  and  g  £  L1( 0, 1;  Xs),  by 

r(u)  =  W{t)u0  +  W(t-  t)(F(u)  +  g){r)dT 

for  any  u  £  Xs.  For  M  >  0  and  T  >  0,  let  St,m  be  a  bounded  subset  of  the  space 
C[0,T-,Xs]: 

St,m  =  lu  G  C[0,  T\  Xs\',  sup  |Hk<M  . 

I  te(o,T)  } 
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Applying  (2.5)-(2.6)  yields  that  for  any  T  >  0  there  exists  a  constant  c  >  0  such  that 

sup  ||«(i)|U.  <  c||uo|U.  +c  [  (II^Hk  +  \\9\\xs)dT. 
te(o,T)  Jo 

Note  that 

||-F(«)IU.  =  ||(a(«))Kt|U,  =  lbM)IU 

for  any  u  G  C[0,  T;  Xs]  and 

||(o(«))x*||.  —  lla  I|s  T  ||a  (u)uxx||s 

<  /d(||u||s+2)||u||s+2  <  P(u\\xs)\\u\\xs 

where  /?( •)  is  a  continuous  monotone  increasing  function  only  depending  on  a.  One 
has  that 

sup  ||r(fi)||x.  <  c(||u0||xs  +  II/IUho, JVC.))  +  cT/3{  sup  ||u(f))||x5  sup  ||€f(t)||x,. 

te(o,T)  t€(0,T)  te(o,T) 

Choose  M  and  T*  such  that 

(2.8)  M  =  2c(||uo||xs  +  II/IIlho.TjXs)})  cT*(5{2M)  <  1/2. 

Then 

sup  ||r(tt)||x.  <  M/ 2  +  cT*/3(M)M  <  M 
te(o,r*) 

for  any  u  G  St*,m-  I*1  addition,  for  any  u,  v  G  SM, 

r (u)  -  T(u)  =  I'  W(t  -  r)  (F(u)  -  F(H))  dr 
Jo 

and  i 

F(u)  -  F(v)  =  (  f  a' (it  +  \{v  -  u))d\(u  -  u)>)  . 

\J 0  J  xx 

Thus 

sup  ||r(u)  -  F(v)\\x,  <  cT*(3(  sup  ||u  +  if|Us)  sup  ||u-u||xs 
te(o,T*)  te(o,T’)  te(o,T) 

<  cT*f3(2M)  sup  ||u  — u||xs 

te(o,T*) 

<  sup  \\u-v\\xJ2. 
te(o,T*) 

So  F  is  a  contraction  in  Its  fixed  point  is  the  desired  solution.  □ 

As  an  advantage  of  this  contraction  principle  approach,  one  can  show  that  the  solu¬ 
tion  map  is  not  only  continuous  but  also  infinitely  many  times  Frechet  differentiable 
if  a  is  a  C°°  function  and  is  analytic  if  a  is  a  polynomial. 

Corollary  2.1.  If  the  function  a  is  C°°  smooth,  then  the  solution  map: 

(u0,  f)  G  Xs  x  L^O.T;  HS(S))  -»  yT,s 

is  C°°  smooth,  i.e.,  it  has  any  order  of  Frechet  derivatives.  Furthermore,  if  a  is  a 
polynomial,  then  the  solution  map  is  analytic. 

Proof,  see  [11]. 
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3.  Spectral  analysis 

In  this  section  we  conduct  a  spectral  analysis  of  the  operator  A  defined  by  (2.3). 
The  result  obtained  in  this  analysis  will  be  the  basis  to  transfer  the  exact  control¬ 
lability  problem  of  the  associated  linear  system  (1.10)  to  a  corresponding  moment 
problem. 


Let  us  define 


£i,o  — 


£2,0  — 


El'k  k 2  V  0 


Eo.k  = 


for  k  —  ±1,  ±2, ....  An  easy  computation  leads  to 
A(Eitk,E2ik)  =  (Ei<k,E2,k)^K  with  ■ 
The  matrix  has  the  eigenvalues 

Am  =  iJk2{k2  +  1) 


-( k 2  +  1)  0 


At, 2  =  -iJk2(k2  +  1) 


with  the  corresponding  eigenvectors 


^  ^k,  1  J  > 


ek,  2  = 

\  k? 


for  k  =  ±1,±2, _ In  addition,  A0,i  =  A0,2  =  0  is  also  an  eigenvalue  with  the 

corresponding  eigenvector 

-  -  (°) 
eo,i  —  eo, 2  — 


A(Eitk,  E2ik)(ekii,ekj2)  —  (Ei:k,E2jk)T,k(ekii,eki2) 

—  (Ai,A;(£li,fc,  E2,k)ek,\,  ^k,2(Eitk,  E2ik)ek>2). 

So  A^i  and  \k,2  are  eigenvalues  of  the  operator  A  with  the  corresponding  eigenvectors 
Vk,  1  —  (Ei,k,E2ik)ekti,  i)k,2  =  (Eiik,E2tk)ek,2- 

for  k  =  ±1,  ±2,  ....  In  addition,  A0  =  0  is  the  eigenvalue  of  the  operator  A  with  the 
corresponding  eigenvector 

*=(!)■ 


Note  that 


Mk  —  (ek:i,ek:2) 


Bing- Yu  Zhang 


305 


as  k  — >  oo.  Thus 


lim  det  M k  =  —2 i  ^  0. 

k—*oo 

Since  {Eltk,  E2jk},  fc  =  0,  ±1,  ±2,...  form  an  orthogonal  basis  for  the  space  Xs,  by 
[3]  we  know  that 

{Vo,  Vk,i,Vk,2,  k  =  ±  1,  ±2, . . . } 
forms  a  Riesz  basis  for  the  space  Xs. 


Note  that  A*  =  -A.  The  spectrum  of  A*  consists  of  the  eigenvalues  p,0  =  A0  =  0 
and 

—  A kj  Afcj 

for  j  =  1,2  and  k  =  ±1,  ±2, . . .  with  the  corresponding  eigenvectors  P0  =  f}0  and 
uk>j  =  ffkj.  Furthermore,  we  define 


mk,j  =  \\rjkj\\xa 

and 

$kj  =  Vk,j/mk,j 

for  j  =  1,2  and  k  =  ±1,  ±2, ....  Then 

[d>o  =  Vo,  4>k,j,J  =  1,2  and  k  =  ±1,  ±2, . . .  } 


forms  an  orthonormal  basis  for  the  space  Xs: 


(4>k,j,  4>m,i)xs  = 


1  if  k  =  m,  j  =  l 
0  otherwise 


The  above  discussion  can  be  summarized  as  the  following  theorem. 


Theorem  3.1.  Let 


j  iyjn2(n2  +  1)  n  =  0,1,2,..., 


i Pnt\ 


and 


[  —iyjn2{n2  +  1)  n  =  — 1,  — 2, ... 

^  0, 1, 2, . . . 

I  n  =  - 1,-2, -3,... 

f  n  =  0,1,2,... 

<Pn,2  =  S 


f  hat4>~n,2  n  =  —1,  -2, . . . 

Then 

(a)  The  spectrum  of  the  operator  A  consists  of  eigenvalues  {A„}^i_00  in  which 
A0  is  a  simple  eigenvalue  with  the  corresponding  eigenvector  cp o  =  <fro,i  =  <Po,2 
and  each  A„,  n  =  ±1,  ±2, ...  is  a  double  eigenvalue  with  the  corresponding 
eigenvector  </>raj-,  j  =  1,2. 
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(b)  {< fig ,  i f>nj,  j  =  1,2,  n  —  ±1,  ±2, . . .  }  forms  an  orthonormal  basis  for  the  space 
Xs  and  any  w  £  Xs  has  the  following  Fourier  series  expansion: 


W  —  ao(po  +  Y,  {otnp4>n,\  +  OLnfl<t>n,2) 
n=— oo,  7^0 


with  ao  =<w,  <f> o  >xs ,  and 

On,  1  =<  W,  <f>n>1  >xs,  Oin,2  =<  W,  f>n:2  >XS 

for  n  =  ±1,  ±2, . . . 


4.  Exact  controllability 

In  this  section  we  prove  the  main  results  of  this  paper.  Following  the  argument  in 
[10]  we  first  consider  the  corresponding  linear  control  system 

{U'tt  U'xx  ux xxx  Gh  x  £  S,  t  £  R , 

u(x,  0)  =  4>(x),  ut(x,  0)  =  ip(x), 

which  can  be  written  as  the  following  abstract  linear  system 

(  f  =  Ay  +  Bh, 


2/(0)  =  «7o 


on  the  space  Xs  where 


u  \  R,  /  0 
V  ~  (  u  J  ’  Bh  ~  Gh 


with  h  £  L2( 0,  T;  Hs(S)).  The  solution  y(t)  of  this  system  can  be  written  as 


y(t)  =  W(t)y0  +  [  W(t-  T)Bh(r)dT. 
Jo 


Based  on  the  spectral  analysis  of  the  operator  A  conducted  earlier, 

y(t)  =  eXota0(f>o  +  Y  +  atn,2<t>n,i) 

71^0 

(4.4)  ft  ,t 

+  eXo{t~T) (50(r)dT  +  Y  }  eAn(<-T)  (/3„,i  (r)(^n  ]  +  PnpiYPnfl)  dr 
Jo  &oJo 

where  a0  =  ( y0 ,  4>o)x„  Po  =  {Bh,  (p0)xs  and 

=  {l/Oi  4>n,j)  XB  )  Pn,j  =  {Bh,  <f>nlj')  Xs  {h,  B  <fin,j)  Hs{S) 

for  j  =  1,2  and  n  =  ±1,  ±2, . . .  Note  that 

{Bh,  (  “  W  =  ((  Gh)'  (l  W 


=  { h,Gv)H‘(s )■ 
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Thus 

(4.5)  H*(S) 

for  any  ^  G  Xs. 

Let 

Xs  =  {(j/o,  Vt)  G  x  Xs,  js(yo)idx  =  Jjyrhdx,  J^yo^dx  =  J^yrhdx  =  oj  . 

Here,  (y)j  denotes  the  j-th  component  of  y  (j  =  1,2).  We  show  that  the  linear 
system  (4.3)  is  exactly  controllable.  In  fact  we  show  that  the  following  stronger  result 
holds  which  is  the  key  for  obtaining  exact  controllability  of  the  nonlinear  system. 

Theorem  4.1.  Let  T  >  0  be  given.  There  exists  a  bounded  linear  operator 

Kt  :  Xs  — >  L2(0,T;  Hs(S)) 

such  that  for  any  (j/o,  j/r)  G  Xs,  the  solution  of 

y{t)  =  Ay(t)  +  BKT(y0 ,  yT ),  y{  0)  =  y0 

satisfies 

y{T)  =  Vt 
and 

Il^r(y0,  yT)||L2(0,T;H»(S))  <  CT  (l|yo||x3  +  ll$r||xs) 

for  some  constant  ct  >  0  independent  of  ( y0:  yr)- 

Proof.  Since  the  system  (4.3)  is  time  reversible  we  may  assume,  without  loss  of 
generality,  that  yr  =  0.  Then  the  exact  control  problem,  letting  t  =  T  in  (4.4)  and 
y(T )  =  0,  consists  of  finding  a  h£  L2(0,  T',HS(S))  such  that 

(cto  +  T/?o)<Ao  +  eXnT (an4>n,  1  +  0‘n,2<Pn,2) 

n^O 

+  /  eAn(T_T)  +  Pn,2 (t)</>„,2)  dr  =  0. 

This  leads  to  the  solvability  of  the  following  moment  problem 

[  _eA nTanii  =  /0T  eM^  pnA(T)dr) 

ao  +  T/3o  =  0,  < 

[  -eA"Tan,2  =  f0T  eMT-r)pn2(T)dT 

for  n  =  ±1,  ±2, . . .  Here  ( </>nj-)2  denotes  the  second  component  of  and  according 
to  the  previous  computation 

J"  if  i  =  1 

(<^nj)  2  =  \ 

(  6fce-ifcx  if  j  =  2. 

where  0  <  m  <  |6fc|  <  M  for  any  k. 
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By  denoting  pk  =  eXkt,  V  =  {pk  |  —  oc  <  k  <  00}  forms  a  Riesz  basis  for  its  closed 
span,  PT  in  L2(0,T)  (see  [4]).  We  let  £  =  {<&  |  —  00  <  k  <  00}  be  the  unique  dual 
Riesz  basis  for  V  in  PT,  i.e.,  the  functions  in  £  are  the  unique  elements  of  Pt  such 
that 

fT  _ 

(4.6)  jf  qi(t)pk(t)dt  =  6 ki,  -00  <1,  k  <00. 

We  take  the  control  h  in  (4.3)  to  have  the  form 


(4.7)  h(x,  t )  =  coq0(t)  +  ^  ®(^)(cf  iG((<^i)2)  +  Q,2G((<^,2)2) 

¥  0 

where  the  coefficients  c0,  Qp  and  q)2  are  to  be  determined  so  that,  among  other  things, 
the  series  (4.7)  is  appropriately  convergent.  Substituting  (4.7)  into  (4.4)  yields,  by 
using  the  biorthogonality  (4.6),  that  we  have  a0  =  Cq  <  G{<f> 0),  G((f>0)  >xB  and 


(4.8) 


!e  XnTaU:  1  =cnii(Gr(^„ii)2,  G((/>„ji)2)ijqs)+cTl)2(Gr(^njl)2,  G(^>„i2)2)jf»(s) 

e~XnTan>i  =  cnti(G(4>np)2,  G(^>n)i)2)fl"«(s)+crai2(G((/)ni2)2,  G(^n,2)2)ff»(5) 


for  n  =  ±1,  ±2, . . .  Let 

(G(<pn, 1)2,  G(0n!i)2)/rqs)  (G((j)nti)2G(4>n,2)2) H‘(S) 

A„  = 

(G(^n>2)2,  G{4>np)2)H°{s)  (G(^„,2)2G(lJ!>n>2)2)/f»(s,) 

—  ||G(0„,i)2||s||G(^„i2)2||s  -  |(G(<?in>i)2, G(<A„>2)jyS(5)|  . 

Note  that  An  ^  0  for  any  n  since  G(<^„i2)2  and  G(<^n,i)2  are  linearly  independent. 
In  addition,  as  n  ->  00,  (G(<^)2,  G^^)#^)  0  and  ||G(<?i„j)2||2  ~  6|,  j  =  1,2. 

Hence  there  exists  a  e  >  0  such  that 


|A„|  >  e 


for  any  n  and  cn,  1  and  cn>2  are  uniquely  determined  by  (4.8).  By  Cramer’s  rule, 


(4.9) 


^n,  1 


A„  1 


2 


A^ 

An 


with 


Anji  — 


e  XnTatni  1  (G(<^nji)2G(0„|2)2)i/s(S) 
e_AnTan>2  (G(«^>rji2)2G((/>rii2)2)ffs(s) 


and 


A„j2  — 


(G(^>„>i)2,G(<^n)i)2)Hs(s)  e  A"ra„,i 
(G(</>n,  2)2,  G(0nji)2)fp(s)  e~XnTaUt2 


ft  remains  to  show  that  h  defined  by  (4.7)  and  (4.9)  G  L2([0, T,];i7s(S').  It  follows 
from  the  same  argument  as  in  the  proof  of  Theorem  1.1  in  [10]  and  is  therefore 
omitted  here.  The  proof  is  complete.  □ 
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Now  we  turn  to  the  nonlinear  system 

(lilt  'U'XX  —  (^(u)  Hxx^xx  T  Gh 

u(x ,  0)  =  4>(x),  ut(x,  0)  =  i/j(x) 
and  prove  the  main  results  of  this  paper. 

Proof  of  Theorem  1.1.  According  to  Theorem  4.1,  there  exists  hi  G  L2(0,  T;  HS(S)) 
for  which  one  may  find  u  G  C([0,T];  HS+2(S))  fl  C^QO,!1];  HS(S))  satisfying 

Utt  'Uxx  +  0.xxxx  Ghi, 

<  u(x,  0)  =  4>o{x),  ut(x ,  0)  =  ipo{x), 

.  u(x,  T)  =  4>t{x),  Ut(x ,  T)  =  tpT(x) 

for  given  (^cV'o)  and  (<^r,  V’t)  in  the  space  Xs.  Adding  -a(u)xx  to  both  sides  of  the 
above  equation  one  obtains 

(  Utt  (n(u))3;3:  “t-  Uxxxx  Ghi  n('u)xx, 


•  u(x,  0)  =  <f>a(x),  ut{x,  0)  =  ipo{x), 

.  u(x,T)  =  (j>T{x ),  ut(x,T)  =  ipT(x)- 
Thus  it  suffices  to  show  that  there  exists  h2  €  C'([0,T];.H’S(5))  such  that 

(Gh2)(x,t)  =  ~{a(u))xx. 

Note  that  a(u)xx  =  a!(u)uxx  +  a"(u)u2x  G  C([0,T];  Hs(S))  since  a  is  a  smooth  func¬ 
tion  and  u  G  CQO,  T];  HS+2(S)).  The  existence  of  h2  follows  from  exactly  the  same 
argument  as  in  the  proof  of  [10,  Theorem  1.1].  The  proof  is  complete.  □ 

Proof  of  Theorem  1.2.  We  first  write  (4.10)  as  the  first  order  evolution  system 

(  ^  =  Au  +  F{u )  +  Bh, 

{  u( 0)  =  u0 

which  can  be  rewritten  in  its  equivalent  integral  equation  form 

(4.11)  u(t)  =  W(t)u0  +  j  W(t  —  t) (Bh((r )  +  jf  W(t  -  t)F{u){t)<It. 

We  define 

w(T,  u)  =  [  W(T  -  t)F{u)(t)<1t. 

Jo 

According  to  Theorem  4.1,  for  given  u0,  ut  G  Xs,  if  one  chooses 

h  =  Kt(uq ,  ut  +  cu(T,  u )) 

in  equation  (4.11),  then 

u(t)  =  W(t)u0+  (  W{t-T){BKT(uQ,uT  +  uj(T,u))(T)dT  +  f  W{t  -  t)F(u){t)<1t 
Jo  JO 
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and  u(0)  =  u0,  u(T)  =  uT  by  virtue  of  the  definition  of  the  operator  KT ■  This 
suggests  that  we  consider  the  map 

Y(u)  =  W(t)u0  +  f  W(t-T)(BKT(u0,uT  +  uj(T,u))(T)dT+  (  W(t  -  r)F({t)(r)dr. 
Jo  J  o 

If  the  map  F  is  shown  to  be  a  contraction  in  an  appropriate  space,  then  its  fixed 
point  u  is  a  solution  of  (4.11)  with  h  —  Kt{uo,  uj-  +  uj{T,  u)  and  satisfies  u(T)  =  Up- 
An  argument  similar  to  the  proof  of  Theorem  2.1  in  section  2  shows  that  the  map  Y 
is  a  contraction  in  the  space  Xs.  The  only  modification  one  needs  is  that  instead  of 
choosing  a  small  T,  one  chooses  a  small  <5  >  0  and  requires 

||mo||xs  <  <5,  II“t|U„  <  6 

so  that  the  map  F  becomes  a  contraction  (see  also  [11]).  The  proof  is  complete. 

□ 
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